How Do Multilingual Language Models Remember Facts?

Constanza Fierro; Negar Foroutan; Desmond Elliott; Anders Søgaard

doi:10.18653/v1/2025.findings-acl.827

How Do Multilingual Language Models Remember Facts?

Constanza Fierro, Negar Foroutan, Desmond Elliott, Anders Søgaard

Abstract

Large Language Models (LLMs) store and retrieve vast amounts of factual knowledge acquired during pre-training. Prior research has localized and identified mechanisms behind knowledge recall; however, it has only focused on English monolingual models. The question of how these mechanisms generalize to non-English languages and multilingual LLMs remains unexplored. In this paper, we address this gap by conducting a comprehensive analysis of three multilingual LLMs. First, we show that previously identified recall mechanisms in English largely apply to multilingual contexts, with nuances based on language and architecture. Next, through patching intermediate representations, we localize the role of language during recall, finding that subject enrichment is language-independent, while object extraction is language-dependent. Additionally, we discover that the last token representation acts as a Function Vector (FV), encoding both the language of the query and the content to be extracted from the subject. Furthermore, in decoder-only LLMs, FVs compose these two pieces of information in two separate stages. These insights reveal unique mechanisms in multilingual LLMs for recalling information, highlighting the need for new methodologies—such as knowledge evaluation, fact editing, and knowledge acquisition—that are specifically tailored for multilingual LLMs.

Anthology ID:: 2025.findings-acl.827
Volume:: Findings of the Association for Computational Linguistics: ACL 2025
Month:: July
Year:: 2025
Address:: Vienna, Austria
Editors:: Wanxiang Che, Joyce Nabende, Ekaterina Shutova, Mohammad Taher Pilehvar
Venue:: Findings
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 16052–16106
Language:
URL:: https://aclanthology.org/2025.findings-acl.827/
DOI:: 10.18653/v1/2025.findings-acl.827
Bibkey:
Cite (ACL):: Constanza Fierro, Negar Foroutan, Desmond Elliott, and Anders Søgaard. 2025. How Do Multilingual Language Models Remember Facts?. In Findings of the Association for Computational Linguistics: ACL 2025, pages 16052–16106, Vienna, Austria. Association for Computational Linguistics.
Cite (Informal):: How Do Multilingual Language Models Remember Facts? (Fierro et al., Findings 2025)
Copy Citation:
PDF:: https://aclanthology.org/2025.findings-acl.827.pdf

PDF Cite Search Fix data