The Anatomy of Evidence: An Investigation Into Explainable ICD Coding

Katharina Beckh; Elisa Studeny; Sujan Sai Gannamaneni; Dario Antweiler; Stefan Rueping

doi:10.18653/v1/2025.findings-acl.864

The Anatomy of Evidence: An Investigation Into Explainable ICD Coding

Katharina Beckh, Elisa Studeny, Sujan Sai Gannamaneni, Dario Antweiler, Stefan Rueping

Abstract

Automatic medical coding has the potential to ease documentation and billing processes. For this task, transparency plays an important role for medical coders and regulatory bodies, which can be achieved using explainability methods. However, the evaluation of these approaches has been mostly limited to short text and binary settings due to a scarcity of annotated data. Recent efforts by Cheng et al. (2023) have introduced the MDACE dataset, which provides a valuable resource containing code evidence in clinical records. In this work, we conduct an in-depth analysis of the MDACE dataset and perform plausibility evaluation of current explainable medical coding systems from an applied perspective. With this, we contribute to a deeper understanding of automatic medical coding and evidence extraction. Our findings reveal that ground truth evidence aligns with code descriptions to a certain degree. An investigation into state-of-the-art approaches shows a high overlap with ground truth evidence. We propose match measures and highlight success and failure cases. Based on our findings, we provide recommendations for developing and evaluating explainable medical coding systems.

Anthology ID:: 2025.findings-acl.864
Volume:: Findings of the Association for Computational Linguistics: ACL 2025
Month:: July
Year:: 2025
Address:: Vienna, Austria
Editors:: Wanxiang Che, Joyce Nabende, Ekaterina Shutova, Mohammad Taher Pilehvar
Venue:: Findings
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 16840–16851
Language:
URL:: https://aclanthology.org/2025.findings-acl.864/
DOI:: 10.18653/v1/2025.findings-acl.864
Bibkey:
Cite (ACL):: Katharina Beckh, Elisa Studeny, Sujan Sai Gannamaneni, Dario Antweiler, and Stefan Rueping. 2025. The Anatomy of Evidence: An Investigation Into Explainable ICD Coding. In Findings of the Association for Computational Linguistics: ACL 2025, pages 16840–16851, Vienna, Austria. Association for Computational Linguistics.
Cite (Informal):: The Anatomy of Evidence: An Investigation Into Explainable ICD Coding (Beckh et al., Findings 2025)
Copy Citation:
PDF:: https://aclanthology.org/2025.findings-acl.864.pdf

PDF Cite Search Fix data