Large Language Models for Multilingual Previously Fact-Checked Claim Detection

Ivan Vykopal; Matúš Pikuliak; Simon Ostermann; Tatiana Anikina; Michal Gregor; Marian Simko

doi:10.18653/v1/2025.findings-emnlp.852

Large Language Models for Multilingual Previously Fact-Checked Claim Detection

Ivan Vykopal, Matúš Pikuliak, Simon Ostermann, Tatiana Anikina, Michal Gregor, Marian Simko

Abstract

In our era of widespread false information, human fact-checkers often face the challenge of duplicating efforts when verifying claims that may have already been addressed in other countries or languages. As false information transcends linguistic boundaries, the ability to automatically detect previously fact-checked claims across languages has become an increasingly important task. This paper presents the first comprehensive evaluation of large language models (LLMs) for multilingual previously fact-checked claim detection. We assess seven LLMs across 20 languages in both monolingual and cross-lingual settings. Our results show that while LLMs perform well for high-resource languages, they struggle with low-resource languages. Moreover, translating original texts into English proved to be beneficial for low-resource languages. These findings highlight the potential of LLMs for multilingual previously fact-checked claim detection and provide a foundation for further research on this promising application of LLMs.

Anthology ID:: 2025.findings-emnlp.852
Volume:: Findings of the Association for Computational Linguistics: EMNLP 2025
Month:: November
Year:: 2025
Address:: Suzhou, China
Editors:: Christos Christodoulopoulos, Tanmoy Chakraborty, Carolyn Rose, Violet Peng
Venue:: Findings
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 15741–15765
Language:
URL:: https://aclanthology.org/2025.findings-emnlp.852/
DOI:: 10.18653/v1/2025.findings-emnlp.852
Bibkey:
Cite (ACL):: Ivan Vykopal, Matúš Pikuliak, Simon Ostermann, Tatiana Anikina, Michal Gregor, and Marian Simko. 2025. Large Language Models for Multilingual Previously Fact-Checked Claim Detection. In Findings of the Association for Computational Linguistics: EMNLP 2025, pages 15741–15765, Suzhou, China. Association for Computational Linguistics.
Cite (Informal):: Large Language Models for Multilingual Previously Fact-Checked Claim Detection (Vykopal et al., Findings 2025)
Copy Citation:
PDF:: https://aclanthology.org/2025.findings-emnlp.852.pdf
Checklist:: 2025.findings-emnlp.852.checklist.pdf

PDF Cite Search Checklist Fix data