RA-RRG: Multimodal Retrieval-Augmented Radiology Report Generation with Key Phrase Extraction

Jonggwon Park; Byungmu Yoon; Soobum Kim; Kyoyun Choi

RA-RRG: Multimodal Retrieval-Augmented Radiology Report Generation with Key Phrase Extraction

Jonggwon Park, Byungmu Yoon, Soobum Kim, Kyoyun Choi

Abstract

Automated radiology report generation (RRG) holds potential to reduce the workload of radiologists, and recent advances in multimodal large language models (MLLMs) have enabled multimodal chest X-ray (CXR) report generation. However, existing MLLMs are computationally expensive, require large-scale training data, and may produce hallucinated content, limiting their practical deployment. To address these limitations, we propose RA-RRG, a retrieval-augmented RRG framework that combines multimodal retrieval with large language models (LLMs) to generate radiology reports while reducing hallucinations and computational demands. RA-RRG uses LLMs to extract clinically essential key phrases from radiology reports and retrieves relevant phrases given an input image. By conditioning LLMs on the retrieved phrases, RA-RRG effectively suppresses hallucinations while maintaining strong report generation performance. Experiments on the MIMIC-CXR and IU X-ray datasets show state-of-the-art results on CheXbert metrics and competitive RadGraph F1 scores compared to MLLMs. Furthermore, RA-RRG naturally generalizes to multi-view RRG by aggregating phrases retrieved from multiple images, highlighting its broad applicability to real-world clinical scenarios. Code is available at https://github.com/deepnoid-ai/RA-RRG.

Anthology ID:: 2026.findings-acl.247
Volume:: Findings of the Association for Computational Linguistics: ACL 2026
Month:: July
Year:: 2026
Address:: San Diego, California, United States
Editors:: Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:: Findings
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 5029–5048
Language:
URL:: https://aclanthology.org/2026.findings-acl.247/
DOI:
Bibkey:
Cite (ACL):: Jonggwon Park, Byungmu Yoon, Soobum Kim, and Kyoyun Choi. 2026. RA-RRG: Multimodal Retrieval-Augmented Radiology Report Generation with Key Phrase Extraction. In Findings of the Association for Computational Linguistics: ACL 2026, pages 5029–5048, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):: RA-RRG: Multimodal Retrieval-Augmented Radiology Report Generation with Key Phrase Extraction (Park et al., Findings 2026)
Copy Citation:
PDF:: https://aclanthology.org/2026.findings-acl.247.pdf
Checklist:: 2026.findings-acl.247.checklist.pdf

PDF Cite Search Checklist Fix data