MedCite: Can Language Models Generate Verifiable Text for Medicine?

Xiao Wang; Mengjue Tan; Qiao Jin; Guangzhi Xiong; Yu Hu; Aidong Zhang; Zhiyong Lu; Minjia Zhang

doi:10.18653/v1/2025.findings-acl.967

MedCite: Can Language Models Generate Verifiable Text for Medicine?

Xiao Wang, Mengjue Tan, Qiao Jin, Guangzhi Xiong, Yu Hu, Aidong Zhang, Zhiyong Lu, Minjia Zhang

Abstract

Existing LLM-based medical question answering systems lack citation generation and evaluation capabilities, raising concerns about their adoption in practice. In this work, we introduce MedCite, the first end-to-end framework that facilitates the design and evaluation of LLM citations for medical tasks. Meanwhile, we introduce a novel multi-pass retrieval-citation method that generates high-quality citations.Our extensive evaluation highlights the challenges and opportunities of citation generation for medical tasks, while identifying important design choices that have a significant impact on the final citation quality. Our proposed method achieves superior citation precision and recall improvements compared to strong baseline methods, and we show that our evaluation results correlate well with annotation results from professional experts.

Anthology ID:: 2025.findings-acl.967
Volume:: Findings of the Association for Computational Linguistics: ACL 2025
Month:: July
Year:: 2025
Address:: Vienna, Austria
Editors:: Wanxiang Che, Joyce Nabende, Ekaterina Shutova, Mohammad Taher Pilehvar
Venue:: Findings
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 18891–18913
Language:
URL:: https://aclanthology.org/2025.findings-acl.967/
DOI:: 10.18653/v1/2025.findings-acl.967
Bibkey:
Cite (ACL):: Xiao Wang, Mengjue Tan, Qiao Jin, Guangzhi Xiong, Yu Hu, Aidong Zhang, Zhiyong Lu, and Minjia Zhang. 2025. MedCite: Can Language Models Generate Verifiable Text for Medicine?. In Findings of the Association for Computational Linguistics: ACL 2025, pages 18891–18913, Vienna, Austria. Association for Computational Linguistics.
Cite (Informal):: MedCite: Can Language Models Generate Verifiable Text for Medicine? (Wang et al., Findings 2025)
Copy Citation:
PDF:: https://aclanthology.org/2025.findings-acl.967.pdf

PDF Cite Search Fix data