Judy Tang

2025

Enhancing Item Difficulty Prediction in Large-scale Assessment with Large Language Model
Mubarak Mojoyinola | Olasunkanmi James Kehinde | Judy Tang
Proceedings of the Artificial Intelligence in Measurement and Education Conference (AIME-Con): Works in Progress

Field testing is a resource-intensive bottleneck in test development. This study applied an interpretable framework that leverages a Large Language Model (LLM) for structured feature extraction from TIMSS items. These features will train several classifiers, whose predictions will be explained using SHAP, providing actionable, diagnostic insights insights for item writers.

Co-authors

Olasunkanmi James Kehinde 1
Mubarak Mojoyinola 1

Venues

AIME-Con1

Fix author