Calibrating Verbal Uncertainty as a Linear Feature to Reduce Hallucinations

Ziwei Ji; Lei Yu; Yeskendir Koishekenov; Yejin Bang; Anthony Hartshorn; Alan Schelten; Cheng Zhang; Pascale Fung; Nicola Cancedda

doi:10.18653/v1/2025.emnlp-main.187

Calibrating Verbal Uncertainty as a Linear Feature to Reduce Hallucinations

Ziwei Ji, Lei Yu, Yeskendir Koishekenov, Yejin Bang, Anthony Hartshorn, Alan Schelten, Cheng Zhang, Pascale Fung, Nicola Cancedda

Abstract

LLMs often adopt an assertive language style also when making false claims. Such ”overconfident hallucinations” mislead users and erode trust. Achieving the ability to express in language the actual degree of uncertainty around a claim is therefore of great importance. We find that ”verbal uncertainty” is governed by a single linear feature in the representation space of LLMs, and shows that this has only moderate correlation with the actual ”semantic uncertainty” of the model. We apply this insight and show that (1) the mismatch between semantic and verbal uncertainty is a better predictor of hallucinations than semantic uncertainty alone and (2) we can intervene on verbal uncertainty at inference time and reduce confident hallucinations on short-form answers, achieving an average relative reduction of ~30%.

Anthology ID:: 2025.emnlp-main.187
Volume:: Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing
Month:: November
Year:: 2025
Address:: Suzhou, China
Editors:: Christos Christodoulopoulos, Tanmoy Chakraborty, Carolyn Rose, Violet Peng
Venue:: EMNLP
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 3769–3793
Language:
URL:: https://aclanthology.org/2025.emnlp-main.187/
DOI:: 10.18653/v1/2025.emnlp-main.187
Bibkey:
Cite (ACL):: Ziwei Ji, Lei Yu, Yeskendir Koishekenov, Yejin Bang, Anthony Hartshorn, Alan Schelten, Cheng Zhang, Pascale Fung, and Nicola Cancedda. 2025. Calibrating Verbal Uncertainty as a Linear Feature to Reduce Hallucinations. In Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, pages 3769–3793, Suzhou, China. Association for Computational Linguistics.
Cite (Informal):: Calibrating Verbal Uncertainty as a Linear Feature to Reduce Hallucinations (Ji et al., EMNLP 2025)
Copy Citation:
PDF:: https://aclanthology.org/2025.emnlp-main.187.pdf
Checklist:: 2025.emnlp-main.187.checklist.pdf

PDF Cite Search Checklist Fix data