Ruikun Hou


2025

pdf bib
LLM-Human Alignment in Evaluating Teacher Questioning Practices: Beyond Ratings to Explanation
Ruikun Hou | Tim Fütterer | Babette Bühler | Patrick Schreyer | Peter Gerjets | Ulrich Trautwein | Enkelejda Kasneci
Proceedings of the Artificial Intelligence in Measurement and Education Conference (AIME-Con): Full Papers

This study investigates the alignment between large language models (LLMs) and human raters in assessing teacher questioning practices, moving beyond rating agreement to the evidence selected to justify their decisions. Findings highlight LLMs’ potential to support large-scale classroom observation through interpretable, evidence-based scoring, with possible implications for concrete teacher feedback.