MultiHaluDet: Multilingual Hallucination Detection via LLM Hidden State Probing

Riasad Alvi; Nurul Labib Sayeedi; Md. Faiyaz Abdullah Sayeedi

MultiHaluDet: Multilingual Hallucination Detection via LLM Hidden State Probing

Riasad Alvi, Nurul Labib Sayeedi, Md. Faiyaz Abdullah Sayeedi

Abstract

Hallucinations in Large Language Models (LLMs) represent a critical barrier to their reliable deployment, a vulnerability heavily exacerbated in non-English and resource-constrained contexts. Existing detection approaches that rely on output confidence heuristics or single-layer internal representations frequently fail to capture deep, complex factual inconsistencies across diverse languages. To address this, we introduce MultiHaluDet, a novel three-stage stacking framework that detects multilingual hallucinations by probing the full hidden state trajectories of frozen LLMs without requiring language-specific fine-tuning. Our method extracts sequential features across multiple layers and processes them via a hybrid architecture using multi-scale attention and self-attention pooling. By generating out-of-fold embeddings that feed into a calibrated classical classifier ensemble, MultiHaluDet captures both fine-grained and coarse-grained patterns of factual inconsistency. Extensive experiments demonstrate that our framework achieves state-of-the-art detection performance, reaching up to 98.55% AUROC on the English HaluEval and TriviaQA benchmarks using Mistral-7B and LLaMA2-7B architectures. Crucially, we rigorously evaluate our framework’s cross-lingual generalization across high (French), medium (Bangla), and low-resource (Amharic) languages. MultiHaluDet demonstrates exceptional representational robustness, consistently outperforming baselines and successfully transferring hallucination detection capabilities across typologically diverse linguistic tiers.

Anthology ID:: 2026.mellm-1.6
Volume:: Proceedings of the 1st Workshop on Multilinguality in the Era of Large Language Models (MeLLM 2026)
Month:: July
Year:: 2026
Address:: San Diego, United States
Editors:: Kaiyu Huang, Fengran Mo, Pinzhen Chen, Meng Jiang
Venues:: MeLLM | WS
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 63–74
Language:
URL:: https://aclanthology.org/2026.mellm-1.6/
DOI:
Bibkey:
Cite (ACL):: Riasad Alvi, Nurul Labib Sayeedi, and Md. Faiyaz Abdullah Sayeedi. 2026. MultiHaluDet: Multilingual Hallucination Detection via LLM Hidden State Probing. In Proceedings of the 1st Workshop on Multilinguality in the Era of Large Language Models (MeLLM 2026), pages 63–74, San Diego, United States. Association for Computational Linguistics.
Cite (Informal):: MultiHaluDet: Multilingual Hallucination Detection via LLM Hidden State Probing (Alvi et al., MeLLM 2026)
Copy Citation:
PDF:: https://aclanthology.org/2026.mellm-1.6.pdf

PDF Cite Search Fix data