L Burleigh

2025

Detecting Math Misconceptions: An AI Benchmark Dataset
Bethany Rittle-Johnson | Rebecca Adler | Kelley Durkin | L Burleigh | Jules King | Scott Crossley
Proceedings of the Artificial Intelligence in Measurement and Education Conference (AIME-Con): Works in Progress

To harness the promise of AI for improving math education, AI models need to be able to diagnose math misconceptions. We created an AI benchmark dataset on math misconceptions and other instructionally-relevant errors, comprising over 52,000 explanations written over 15 math questions that were scored by expert human raters.

Co-authors

Rebecca Adler 1
Scott Crossley 1
Kelley Durkin 1
Jules King 1
Bethany Rittle-Johnson 1

Venues

aimecon1

Fix author