TruthTrap: A Bilingual Benchmark for Evaluating Factually Correct Yet Misleading Information in Question Answering

Mohammadamin Shafiei; Hamidreza Saffari; Mohammad Taher Pilehvar; Alessandro Raganato

TruthTrap: A Bilingual Benchmark for Evaluating Factually Correct Yet Misleading Information in Question Answering

Mohammadamin Shafiei, Hamidreza Saffari, Mohammad Taher Pilehvar, Alessandro Raganato

Abstract

Large Language Models (LLMs) are increasingly used to answer factual, information-seeking questions (ISQs). While prior work often focuses on false, misleading information, little attention has been paid to true but strategically persuasive content that can derail a model’s reasoning. To address this gap, we introduce a new evaluation dataset, TruthTrap, in two languages, i.e., English and Farsi, on Iran-related ISQs, each paired with a correct explanation and a persuasive-yet-misleading true hint. We then evaluate nine diverse LLMs (spanning proprietary and open-source systems) via factuality classification and multiple-choice QA tasks, finding that accuracy drops by 25%, on average, when models encounter these misleading yet factual hints. Also, the models’ predictions match the hint-aligned options up to 77 percent of the time. Notably, models often misjudge such hints in isolation yet still integrate them into final answers. Our results highlight a significant limitation in LLM outputs, underscoring the importance of robust fact-verification and emphasizing real-world risks posed by partial truths in domains like social media, education, and policy-making.

Anthology ID:: 2026.findings-eacl.155
Volume:: Findings of the Association for Computational Linguistics: EACL 2026
Month:: March
Year:: 2026
Address:: Rabat, Morocco
Editors:: Vera Demberg, Kentaro Inui, Lluís Marquez
Venue:: Findings
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 2966–2987
Language:
URL:: https://aclanthology.org/2026.findings-eacl.155/
DOI:
Bibkey:
Cite (ACL):: Mohammadamin Shafiei, Hamidreza Saffari, Mohammad Taher Pilehvar, and Alessandro Raganato. 2026. TruthTrap: A Bilingual Benchmark for Evaluating Factually Correct Yet Misleading Information in Question Answering. In Findings of the Association for Computational Linguistics: EACL 2026, pages 2966–2987, Rabat, Morocco. Association for Computational Linguistics.
Cite (Informal):: TruthTrap: A Bilingual Benchmark for Evaluating Factually Correct Yet Misleading Information in Question Answering (Shafiei et al., Findings 2026)
Copy Citation:
PDF:: https://aclanthology.org/2026.findings-eacl.155.pdf
Checklist:: 2026.findings-eacl.155.checklist.pdf

PDF Cite Search Checklist Fix data