Multi-LLM Verification for Question Answering under Conflicting Contexts

Geetanjali Rakshit; Jeffrey Flanigan

Multi-LLM Verification for Question Answering under Conflicting Contexts

Abstract

Open-domain question answering (ODQA) often requires models to resolve conflicting evidence retrieved from diverse sources—a task that remains challenging even for state-of-the-art large language models (LLMs). While single-agent techniques such as self-verification and self-consistency have shown promise across natural language understanding and generation tasks, and multi-agent approaches involving collaborative or competitive strategies have recently emerged, their effectiveness for ODQA in the presence of conflicting contexts remains underexplored. In this work, we investigate these techniques using the QACC dataset as a case study. We find that incorporating a multi-agent verification step—where the best answer is selected from among outputs generated by different LLMs—leads to improved performance. Interestingly, we also observe that requiring explanations during the verification step does not always improve answer quality. Our experiments evaluate three strong LLMs (GPT-4o, Claude 4, and DeepSeek-R1) across a range of prompting and verification baselines.

Anthology ID:: 2025.ranlp-1.116
Volume:: Proceedings of the 15th International Conference on Recent Advances in Natural Language Processing - Natural Language Processing in the Generative AI Era
Month:: September
Year:: 2025
Address:: Varna, Bulgaria
Editors:: Galia Angelova, Maria Kunilovskaya, Marie Escribe, Ruslan Mitkov
Venue:: RANLP
SIG:
Publisher:: INCOMA Ltd., Shoumen, Bulgaria
Note:
Pages:: 1012–1021
Language:
URL:: https://aclanthology.org/2025.ranlp-1.116/
DOI:
Bibkey:
Cite (ACL):: Geetanjali Rakshit and Jeffrey Flanigan. 2025. Multi-LLM Verification for Question Answering under Conflicting Contexts. In Proceedings of the 15th International Conference on Recent Advances in Natural Language Processing - Natural Language Processing in the Generative AI Era, pages 1012–1021, Varna, Bulgaria. INCOMA Ltd., Shoumen, Bulgaria.
Cite (Informal):: Multi-LLM Verification for Question Answering under Conflicting Contexts (Rakshit & Flanigan, RANLP 2025)
Copy Citation:
PDF:: https://aclanthology.org/2025.ranlp-1.116.pdf

PDF Cite Search Fix data