On Finding Inconsistencies in Documents

Charles Lovering; Seth Ebner; Brandon Smock; Michael Krumdick; Muhammad Saad Rabbani; Ahmed Muhammad; Varshini Reddy; Chris Tanner

On Finding Inconsistencies in Documents

Charles Lovering, Seth Ebner, Brandon Smock, Michael Krumdick, Muhammad Saad Rabbani, Ahmed Muhammad, Varshini Reddy, Chris Tanner

Abstract

Professionals in academia, law, and finance audit their documents because inconsistencies can result in monetary, reputational, and scientific costs. Language models (LMs) have the potential to dramatically speed up this auditing process. To understand their abilities, we introduce a benchmark, FIND (**F**inding **IN**consistencies in **D**ocuments), where each example is a document with an inconsistency inserted manually by a domain expert. Despite the documents being long, technical, and complex, the best-performing model (‘gpt-5‘) recovered 64% of the inserted inconsistencies. Surprisingly, ‘gpt-5‘ also found inconsistencies already present in the original documents. For example, on 50 arXiv papers, we judged 136 out of 196 of the model’s suggestions to be legitimate inconsistencies missed by the original authors. However, despite these findings, even the best models miss almost half of the inconsistencies in FIND, demonstrating that inconsistency detection is still a challenging task.

Anthology ID:: 2026.findings-acl.1675
Volume:: Findings of the Association for Computational Linguistics: ACL 2026
Month:: July
Year:: 2026
Address:: San Diego, California, United States
Editors:: Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:: Findings
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 33523–33564
Language:
URL:: https://aclanthology.org/2026.findings-acl.1675/
DOI:
Bibkey:
Cite (ACL):: Charles Lovering, Seth Ebner, Brandon Smock, Michael Krumdick, Muhammad Saad Rabbani, Ahmed Muhammad, Varshini Reddy, and Chris Tanner. 2026. On Finding Inconsistencies in Documents. In Findings of the Association for Computational Linguistics: ACL 2026, pages 33523–33564, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):: On Finding Inconsistencies in Documents (Lovering et al., Findings 2026)
Copy Citation:
PDF:: https://aclanthology.org/2026.findings-acl.1675.pdf
Checklist:: 2026.findings-acl.1675.checklist.pdf

PDF Cite Search Checklist Fix data