Phases of Uncertainty: Confidence–Calibration Dynamics in Language Model Training

Aneesh Durai

doi:10.18653/v1/2025.uncertainlp-main.2

Phases of Uncertainty: Confidence–Calibration Dynamics in Language Model Training

Abstract

Autoregressive language models achieve strong performance across a wide range of natural language processing (NLP) tasks, yet their uncertainty estimates remain poorly understood, particularly during training. Prior work has primarily evaluated calibration and out-of-distribution (OOD) robustness at the final checkpoint, overlooking the dynamics that unfold earlier. We introduce a phase-based framework for tracking uncertainty metrics—including expected calibration error (ECE) and Kullback–Leibler (KL) divergence—across distinct stages of training. Using GPT-2 models trained across multiple random seeds, we find that uncertainty dynamics follow a consistent set of phases: models begin conservative and relatively well calibrated, but later phases introduce a paradoxical decoupling where confidence increases even as calibration worsens, especially under distribution shift. This paradox implies that the final checkpoint is not always the most reliable for deployment and motivates phase-aware strategies such as dynamic checkpoint selection or targeted calibration. Our findings highlight that uncertainty should be understood as a training-dependent property rather than a static one, opening new directions for scaling this framework to larger models, tasks, and distribution shift scenarios.

Anthology ID:: 2025.uncertainlp-main.2
Volume:: Proceedings of the 2nd Workshop on Uncertainty-Aware NLP (UncertaiNLP 2025)
Month:: November
Year:: 2025
Address:: Suzhou, China
Editors:: Bryan Eikema, Raúl Vázquez, Jonathan Berant, Marie-Catherine de Marneffe, Barbara Plank, Artem Shelmanov, Swabha Swayamdipta, Jörg Tiedemann, Chrysoula Zerva, Wilker Aziz
Venues:: UncertaiNLP | WS
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 11–16
Language:
URL:: https://aclanthology.org/2025.uncertainlp-main.2/
DOI:: 10.18653/v1/2025.uncertainlp-main.2
Bibkey:
Cite (ACL):: Aneesh Durai. 2025. Phases of Uncertainty: Confidence–Calibration Dynamics in Language Model Training. In Proceedings of the 2nd Workshop on Uncertainty-Aware NLP (UncertaiNLP 2025), pages 11–16, Suzhou, China. Association for Computational Linguistics.
Cite (Informal):: Phases of Uncertainty: Confidence–Calibration Dynamics in Language Model Training (Durai, UncertaiNLP 2025)
Copy Citation:
PDF:: https://aclanthology.org/2025.uncertainlp-main.2.pdf

PDF Cite Search Fix data