Towards Long Context Hallucination Detection

Siyi Liu; Kishaloy Halder; Zheng Qi; Wei Xiao; Nikolaos Pappas; Phu Mon Htut; Neha Anna John; Yassine Benajiba; Dan Roth

doi:10.18653/v1/2025.findings-naacl.436

Towards Long Context Hallucination Detection

Siyi Liu, Kishaloy Halder, Zheng Qi, Wei Xiao, Nikolaos Pappas, Phu Mon Htut, Neha Anna John, Yassine Benajiba, Dan Roth

Abstract

Large Language Models (LLMs) have demonstrated remarkable performance across various tasks. However, they are prone to contextual hallucination, generating information that is either unsubstantiated or contradictory to the given context. Although many studies have investigated contextual hallucinations in LLMs, addressing them in long-context inputs remains an open problem. In this work, we take an initial step toward solving this problem by constructing a dataset specifically designed for long-context hallucination detection. Furthermore, we propose a novel architecture that enables pre-trained encoder models, such as BERT, to process long contexts and effectively detect contextual hallucinations through a decomposition and aggregation mechanism. Our experimental results show that the proposed architecture significantly outperforms previous models of similar size as well as LLM-based models across various metrics, while providing substantially faster inference. We publicly release our dataset and code to promote research along the same line.

Anthology ID:: 2025.findings-naacl.436
Volume:: Findings of the Association for Computational Linguistics: NAACL 2025
Month:: April
Year:: 2025
Address:: Albuquerque, New Mexico
Editors:: Luis Chiruzzo, Alan Ritter, Lu Wang
Venue:: Findings
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 7827–7835
Language:
URL:: https://aclanthology.org/2025.findings-naacl.436/
DOI:: 10.18653/v1/2025.findings-naacl.436
Bibkey:
Cite (ACL):: Siyi Liu, Kishaloy Halder, Zheng Qi, Wei Xiao, Nikolaos Pappas, Phu Mon Htut, Neha Anna John, Yassine Benajiba, and Dan Roth. 2025. Towards Long Context Hallucination Detection. In Findings of the Association for Computational Linguistics: NAACL 2025, pages 7827–7835, Albuquerque, New Mexico. Association for Computational Linguistics.
Cite (Informal):: Towards Long Context Hallucination Detection (Liu et al., Findings 2025)
Copy Citation:
PDF:: https://aclanthology.org/2025.findings-naacl.436.pdf

PDF Cite Search Fix data