Olasunkanmi James Kehinde
2025
Enhancing Item Difficulty Prediction in Large-scale Assessment with Large Language Model
Mubarak Mojoyinola
|
Olasunkanmi James Kehinde
|
Judy Tang
Proceedings of the Artificial Intelligence in Measurement and Education Conference (AIME-Con): Works in Progress
Field testing is a resource-intensive bottleneck in test development. This study applied an interpretable framework that leverages a Large Language Model (LLM) for structured feature extraction from TIMSS items. These features will train several classifiers, whose predictions will be explained using SHAP, providing actionable, diagnostic insights insights for item writers.