Matthew S. Johnson

2025

Leveraging multi-AI agents for a teacher co-design
Hongwen Guo | Matthew S. Johnson | Luis Saldivia | Michelle Worthington | Kadriye Ercikan
Proceedings of the Artificial Intelligence in Measurement and Education Conference (AIME-Con): Full Papers

This study uses multi-AI agents to accelerate teacher co-design efforts. It innovatively links student profiles obtained from numerical assessment data to AI agents in natural languages. The AI agents simulate human inquiry, enrich feedback and ground it in teachers’ knowledge and practice, showing significant potential for transforming assessment practice and research.

2020

pdf bib abs

Using PRMSE to evaluate automated scoring systems in the presence of label noise
Anastassia Loukina | Nitin Madnani | Aoife Cahill | Lili Yao | Matthew S. Johnson | Brian Riordan | Daniel F. McCaffrey
Proceedings of the Fifteenth Workshop on Innovative Use of NLP for Building Educational Applications

The effect of noisy labels on the performance of NLP systems has been studied extensively for system training. In this paper, we focus on the effect that noisy labels have on system evaluation. Using automated scoring as an example, we demonstrate that the quality of human ratings used for system evaluation have a substantial impact on traditional performance metrics, making it impossible to compare system evaluations on labels with different quality. We propose that a new metric, PRMSE, developed within the educational measurement community, can help address this issue, and provide practical guidelines on using PRMSE.

Co-authors

Daniel F. McCaffrey 1

Brian Riordan 1

Luis Saldivia 1

Michelle Worthington 1

Lili Yao 1

Venues

AIME-Con1
BEA1

Fix author