Analyzing LLMs’ Knowledge Boundary Cognition Across Languages Through the Lens of Internal Representations

Chenghao Xiao; Hou Pong Chan; Hao Zhang; Mahani Aljunied; Lidong Bing; Noura Al Moubayed; Yu Rong

doi:10.18653/v1/2025.acl-long.1174

Analyzing LLMs’ Knowledge Boundary Cognition Across Languages Through the Lens of Internal Representations

Chenghao Xiao, Hou Pong Chan, Hao Zhang, Mahani Aljunied, Lidong Bing, Noura Al Moubayed, Yu Rong

Abstract

While understanding the knowledge boundaries of LLMs is crucial to prevent hallucination, research on the knowledge boundaries of LLMs has predominantly focused on English. In this work, we present the first study to analyze how LLMs recognize knowledge boundaries across different languages by probing their internal representations when processing known and unknown questions in multiple languages. Our empirical studies reveal three key findings: 1) LLMs’ perceptions of knowledge boundaries are encoded in the middle to middle-upper layers across different languages. 2) Language differences in knowledge boundary perception follow a linear structure, which motivates our proposal of a training-free alignment method that effectively transfers knowledge boundary perception ability across languages, thereby helping reduce hallucination risk in low-resource languages; 3) Fine-tuning on bilingual question pair translation further enhances LLMs’ recognition of knowledge boundaries across languages. Given the absence of standard testbeds for cross-lingual knowledge boundary analysis, we construct a multilingual evaluation suite comprising three representative types of knowledge boundary data. Our code and datasets are publicly available at https://github.com/DAMO-NLP-SG/LLM-Multilingual-Knowledge-Boundaries.

Anthology ID:: 2025.acl-long.1174
Volume:: Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Month:: July
Year:: 2025
Address:: Vienna, Austria
Editors:: Wanxiang Che, Joyce Nabende, Ekaterina Shutova, Mohammad Taher Pilehvar
Venue:: ACL
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 24099–24115
Language:
URL:: https://aclanthology.org/2025.acl-long.1174/
DOI:: 10.18653/v1/2025.acl-long.1174
Bibkey:
Cite (ACL):: Chenghao Xiao, Hou Pong Chan, Hao Zhang, Mahani Aljunied, Lidong Bing, Noura Al Moubayed, and Yu Rong. 2025. Analyzing LLMs’ Knowledge Boundary Cognition Across Languages Through the Lens of Internal Representations. In Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 24099–24115, Vienna, Austria. Association for Computational Linguistics.
Cite (Informal):: Analyzing LLMs’ Knowledge Boundary Cognition Across Languages Through the Lens of Internal Representations (Xiao et al., ACL 2025)
Copy Citation:
PDF:: https://aclanthology.org/2025.acl-long.1174.pdf

PDF Cite Search Fix data