Benchmarking Query-Conditioned Natural Language Inference

Marc E. Canby; Xinchi Chen; Xing Niu; Jifan Chen; Bonan Min; Sergul Aydore; Vittorio Castelli

doi:10.18653/v1/2025.findings-acl.765

Benchmarking Query-Conditioned Natural Language Inference

Marc E. Canby, Xinchi Chen, Xing Niu, Jifan Chen, Bonan Min, Sergul Aydore, Vittorio Castelli

Abstract

The growing excitement around the ability of large language models (LLMs) to tackle various tasks has been tempered by their propensity for generating unsubstantiated information (hallucination) and by their inability to effectively handle inconsistent inputs. To detect such issues, we propose the novel task of Query-Conditioned Natural Language Inference (QC-NLI), where the goal is to determine the semantic relationship (e.g. entailment or not entailment) between two documents conditioned on a query; we demonstrate that many common tasks regarding inconsistency detection can be formulated as QC-NLI problems. We focus on three applications in particular: fact verification, intrinsic hallucination detection, and document inconsistency detection. We convert existing datasets for these tasks into the QC-NLI format, and manual annotation confirms their high quality. Finally, we employ zero- and few-shot prompting methods to solve the QC-NLI prediction problem for each task, showing the critical importance of conditioning on the query.

Anthology ID:: 2025.findings-acl.765
Volume:: Findings of the Association for Computational Linguistics: ACL 2025
Month:: July
Year:: 2025
Address:: Vienna, Austria
Editors:: Wanxiang Che, Joyce Nabende, Ekaterina Shutova, Mohammad Taher Pilehvar
Venue:: Findings
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 14808–14835
Language:
URL:: https://aclanthology.org/2025.findings-acl.765/
DOI:: 10.18653/v1/2025.findings-acl.765
Bibkey:
Cite (ACL):: Marc E. Canby, Xinchi Chen, Xing Niu, Jifan Chen, Bonan Min, Sergul Aydore, and Vittorio Castelli. 2025. Benchmarking Query-Conditioned Natural Language Inference. In Findings of the Association for Computational Linguistics: ACL 2025, pages 14808–14835, Vienna, Austria. Association for Computational Linguistics.
Cite (Informal):: Benchmarking Query-Conditioned Natural Language Inference (Canby et al., Findings 2025)
Copy Citation:
PDF:: https://aclanthology.org/2025.findings-acl.765.pdf

PDF Cite Search Fix data