Learning from Textual Radiology Reports: A Benchmark Dataset for Coronary CT Angiography

Sudharshan Balaji; Zhiyu Liu; Zhengyuan Jiang; Shuo Lei; Yimin Chen; Yang Xiao; Shone O. Almeida; Mathew Joseph Karivelil; Christopher Malanga; Ning Wang

Learning from Textual Radiology Reports: A Benchmark Dataset for Coronary CT Angiography

Sudharshan Balaji, Zhiyu Liu, Zhengyuan Jiang, Shuo Lei, Yimin Chen, Yang Xiao, Shone O. Almeida, Mathew Joseph Karivelil, Christopher Malanga, Ning Wang

Abstract

While coronary imaging is widely used for anatomical assessment, CCTA reports play a distinct last-mile role in clinical care. Ratherthan serving as an intermediate signal, CCTA provides an assessment of coronary disease severity (known as the CAD-RADS score) toguide patient management. However, real-world clinical text exhibits substantial heterogeneity in terminology and structure, leadingto inconsistent interpretation by automated systems, even for clinically similar cases. Recent work leverages a direct application ofLLMs for automated CAD-RADS scoring, but is limited by small, non-public, and homogeneous clinical data. We introduce CCTA-RADS, the largest publicly available dataset of 940 real-world CCTA reports from a major cardiovascular center, each annotated with CAD-RADS scores. Our analysis reveals that direct approaches, including state-of-the-art LLMs (GPT-4o, GPT-o3) and fine-tuned BERT models underperform on diverse real-world clinical data. To address these limitations, we propose a two-stage pipeline that decouples structuring from classification: an LLM-based parser normalizes heterogeneous reports into structured format, followed by fine-tuned BERT classification. This approach substantially improves the F1-score by 6%-13% compared with direct methods. We deploy our system as an interactive web interface that allows clinicians to upload CCTA reports for automated CAD-RADS assessment with SHAP and LIME explainability visualizations.

Anthology ID:: 2026.acl-industry.33
Volume:: Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (ACL 2026)
Month:: July
Year:: 2026
Address:: San Diego, California, USA
Editors:: Yunyao Li, Georg Rehm, Mei Tu
Venue:: ACL
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 480–493
Language:
URL:: https://aclanthology.org/2026.acl-industry.33/
DOI:
Bibkey:
Cite (ACL):: Sudharshan Balaji, Zhiyu Liu, Zhengyuan Jiang, Shuo Lei, Yimin Chen, Yang Xiao, Shone O. Almeida, Mathew Joseph Karivelil, Christopher Malanga, and Ning Wang. 2026. Learning from Textual Radiology Reports: A Benchmark Dataset for Coronary CT Angiography. In Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (ACL 2026), pages 480–493, San Diego, California, USA. Association for Computational Linguistics.
Cite (Informal):: Learning from Textual Radiology Reports: A Benchmark Dataset for Coronary CT Angiography (Balaji et al., ACL 2026)
Copy Citation:
PDF:: https://aclanthology.org/2026.acl-industry.33.pdf

PDF Cite Search Fix data