Can We Trust AI Doctors? A Survey of Medical Hallucination in Large Language and Large Vision-Language Models

Zhihong Zhu; Yunyan Zhang; Xianwei Zhuang; Fan Zhang; Zhongwei Wan; Yuyan Chen; Qingqing Long; Yefeng Zheng; Xian Wu

doi:10.18653/v1/2025.findings-acl.350

Can We Trust AI Doctors? A Survey of Medical Hallucination in Large Language and Large Vision-Language Models

Zhihong Zhu, Yunyan Zhang, Xianwei Zhuang, Fan Zhang, Zhongwei Wan, Yuyan Chen, Qingqing Long, Yefeng Zheng, Xian Wu

Abstract

Hallucination has emerged as a critical challenge for large language models (LLMs) and large vision-language models (LVLMs), particularly in high-stakes medical applications. Despite its significance, dedicated research on medical hallucination remains unexplored. In this survey, we first provide a unified perspective on medical hallucination for both LLMs and LVLMs, and delve into its causes. Subsequently, we review recent advancements in detecting, evaluating, and mitigating medical hallucinations, offering a comprehensive overview of evaluation benchmarks, metrics, and strategies developed to tackle this issue. Moreover, we delineate the current challenges and delve into new frontiers, thereby shedding light on future research. We hope this work coupled with open-source resources can provide the community with quick access and spur breakthrough research in medical hallucination.

Anthology ID:: 2025.findings-acl.350
Volume:: Findings of the Association for Computational Linguistics: ACL 2025
Month:: July
Year:: 2025
Address:: Vienna, Austria
Editors:: Wanxiang Che, Joyce Nabende, Ekaterina Shutova, Mohammad Taher Pilehvar
Venue:: Findings
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 6748–6769
Language:
URL:: https://aclanthology.org/2025.findings-acl.350/
DOI:: 10.18653/v1/2025.findings-acl.350
Bibkey:
Cite (ACL):: Zhihong Zhu, Yunyan Zhang, Xianwei Zhuang, Fan Zhang, Zhongwei Wan, Yuyan Chen, Qingqing Long, Yefeng Zheng, and Xian Wu. 2025. Can We Trust AI Doctors? A Survey of Medical Hallucination in Large Language and Large Vision-Language Models. In Findings of the Association for Computational Linguistics: ACL 2025, pages 6748–6769, Vienna, Austria. Association for Computational Linguistics.
Cite (Informal):: Can We Trust AI Doctors? A Survey of Medical Hallucination in Large Language and Large Vision-Language Models (Zhu et al., Findings 2025)
Copy Citation:
PDF:: https://aclanthology.org/2025.findings-acl.350.pdf

PDF Cite Search Fix data