CiteGuard: Faithful Citation Attribution for LLMs via Retrieval-Augmented Validation

Yee Man Choi; Xuehang Guo; Yi R. Fung; Qingyun Wang

CiteGuard: Faithful Citation Attribution for LLMs via Retrieval-Augmented Validation

Yee Man Choi, Xuehang Guo, Yi R. Fung, Qingyun Wang

Abstract

Large Language Models (LLMs) have emerged as powerful assistants for scientific writing. However, concerns remain about the quality and reliability of the generated text, including citation accuracy and faithfulness. While most recent work relies on methods such as LLM-as-a-Judge, the reliability of LLM-as-a-Judge alone is also in doubt. In this work, we reframe citation evaluation as a problem of citation attribution alignment, which assesses whether LLM-generated citations match those a human author would include for the same text. We propose CiteGuard, a retrieval-aware agent framework designed to provide more faithful grounding for citation validation. CiteGuard improves over the prior baseline by 10 percentage points and achieves up to 68.1% accuracy on the CiteME benchmark, approaching human performance (69.2%). It also identifies alternative valid citations and demonstrates generalization ability for cross-domain citation attribution.

Anthology ID:: 2026.acl-long.282
Volume:: Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Month:: July
Year:: 2026
Address:: San Diego, California, United States
Editors:: Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:: ACL
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 6241–6257
Language:
URL:: https://aclanthology.org/2026.acl-long.282/
DOI:
Bibkey:
Cite (ACL):: Yee Man Choi, Xuehang Guo, Yi R. Fung, and Qingyun Wang. 2026. CiteGuard: Faithful Citation Attribution for LLMs via Retrieval-Augmented Validation. In Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 6241–6257, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):: CiteGuard: Faithful Citation Attribution for LLMs via Retrieval-Augmented Validation (Choi et al., ACL 2026)
Copy Citation:
PDF:: https://aclanthology.org/2026.acl-long.282.pdf
Checklist:: 2026.acl-long.282.checklist.pdf

PDF Cite Search Checklist Fix data