L Burleigh
2025
Detecting Math Misconceptions: An AI Benchmark Dataset
Bethany Rittle-Johnson
|
Rebecca Adler
|
Kelley Durkin
|
L Burleigh
|
Jules King
|
Scott Crossley
Proceedings of the Artificial Intelligence in Measurement and Education Conference (AIME-Con): Works in Progress
To harness the promise of AI for improving math education, AI models need to be able to diagnose math misconceptions. We created an AI benchmark dataset on math misconceptions and other instructionally-relevant errors, comprising over 52,000 explanations written over 15 math questions that were scored by expert human raters.