Cognitive-Uncertainty Guided Knowledge Distillation for Accurate Classification of Student Misconceptions

Qirui Liu; Hao Chen; Weijie Shi; Jiajie Xu; Jia Zhu

Cognitive-Uncertainty Guided Knowledge Distillation for Accurate Classification of Student Misconceptions

Qirui Liu, Hao Chen, Weijie Shi, Jiajie Xu, Jia Zhu

Abstract

Accurately identifying student misconceptions is crucial for personalized education but faces three challenges: (1) data scarcity with long-tail distribution, where authentic student reasoning is difficult to synthesize; (2) fuzzy boundaries between error categories with high annotation noise; (3) deployment paradox—large models overlook unconventional approaches due to pretraining bias and cannot be deployed on edge, while small models overfit to noise. Unlike traditional methods that increase diversity through large-scale data synthesis, we propose a two-stage knowledge distillation framework that mines high-value samples from existing data. The first stage performs standard distillation to transfer task capabilities. The second stage introduces a dual-layer marginal selection mechanism based on cognitive uncertainty, identifying four types of critical samples based on teacher model uncertainty and confidence differences. For different data subsets, we design difficulty-adaptive mechanism to balance hard/soft label contributions, enabling student models to inherit inter-class relationships from teacher soft labels while distinguishing ambiguous error types. Experiments show that with augmented training on only 10.30% of filtered samples, we achieve MAP@3 of 0.9585 (+17.8%) on the MAP-Charting dataset, and using only a 4B parameter model, we attain 84.38% accuracy on cross-topic tests of middle school algebra misconception benchmarks, significantly outperforming sota LLM (67.73%) and standard fine-tuned 72B models (81.25%). Our code is available at https://anonymous.4open.science/r/acl2026_map-5847/.

Anthology ID:: 2026.findings-acl.1498
Volume:: Findings of the Association for Computational Linguistics: ACL 2026
Month:: July
Year:: 2026
Address:: San Diego, California, United States
Editors:: Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:: Findings
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 29964–29980
Language:
URL:: https://aclanthology.org/2026.findings-acl.1498/
DOI:
Bibkey:
Cite (ACL):: Qirui Liu, Hao Chen, Weijie Shi, Jiajie Xu, and Jia Zhu. 2026. Cognitive-Uncertainty Guided Knowledge Distillation for Accurate Classification of Student Misconceptions. In Findings of the Association for Computational Linguistics: ACL 2026, pages 29964–29980, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):: Cognitive-Uncertainty Guided Knowledge Distillation for Accurate Classification of Student Misconceptions (Liu et al., Findings 2026)
Copy Citation:
PDF:: https://aclanthology.org/2026.findings-acl.1498.pdf
Checklist:: 2026.findings-acl.1498.checklist.pdf

PDF Cite Search Checklist Fix data