Rad-Flamingo: A Multimodal Prompt driven Radiology Report Generation Framework with Patient-Centric Explanations

Md. Tousin Akhter; Devansh Lalwani; Kshitij Sharad Jadhav; Pushpak Bhattacharyya

Rad-Flamingo: A Multimodal Prompt driven Radiology Report Generation Framework with Patient-Centric Explanations

Md. Tousin Akhter, Devansh Lalwani, Kshitij Sharad Jadhav, Pushpak Bhattacharyya

Abstract

In modern healthcare, radiology plays a pivotal role in diagnosing and managing diseases. However, the complexity of medical imaging data and the variability in interpretation can lead to inconsistencies and a lack of patient-centered insight in radiology reports. To address this challenge, a novel multimodal prompt-driven report generation framework Rad-Flamingo was developed, that integrates diverse data modalities—such as medical images, and clinical notes—to produce comprehensive and context-aware radiology reports. Our framework leverages innovative prompt engineering techniques to guide vision-language models in generating relevant information, ensuring these generated reports are not only accurate but also understandable to individual patients. A key feature of our framework is its ability to provide patient-centric explanations, offering clear and personalized insights into diagnostic findings and their implications. Additionally, we also demonstrate a synthetic data generation pipeline, to append any existing benchmark datasets’ findings and impressions with patient-centric explanation. Experimental results demonstrate that this framework’s effectiveness in enhancing report quality, improving understandability, and could foster better patient-doctor communication. This approach represents a significant step towards human-centered medical AI systems.

Anthology ID:: 2026.findings-eacl.10
Volume:: Findings of the Association for Computational Linguistics: EACL 2026
Month:: March
Year:: 2026
Address:: Rabat, Morocco
Editors:: Vera Demberg, Kentaro Inui, Lluís Marquez
Venue:: Findings
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 166–188
Language:
URL:: https://aclanthology.org/2026.findings-eacl.10/
DOI:
Bibkey:
Cite (ACL):: Md. Tousin Akhter, Devansh Lalwani, Kshitij Sharad Jadhav, and Pushpak Bhattacharyya. 2026. Rad-Flamingo: A Multimodal Prompt driven Radiology Report Generation Framework with Patient-Centric Explanations. In Findings of the Association for Computational Linguistics: EACL 2026, pages 166–188, Rabat, Morocco. Association for Computational Linguistics.
Cite (Informal):: Rad-Flamingo: A Multimodal Prompt driven Radiology Report Generation Framework with Patient-Centric Explanations (Akhter et al., Findings 2026)
Copy Citation:
PDF:: https://aclanthology.org/2026.findings-eacl.10.pdf
Checklist:: 2026.findings-eacl.10.checklist.pdf

PDF Cite Search Checklist Fix data