Code Synonyms Do Matter: Multiple Synonyms Matching Network for Automatic ICD Coding

Zheng Yuan, Chuanqi Tan, Songfang Huang


Abstract
Automatic ICD coding is defined as assigning disease codes to electronic medical records (EMRs).Existing methods usually apply label attention with code representations to match related text snippets. Unlike these works that model the label with the code hierarchy or description, we argue that the code synonyms can provide more comprehensive knowledge based on the observation that the code expressions in EMRs vary from their descriptions in ICD. By aligning codes to concepts in UMLS, we collect synonyms of every code. Then, we propose a multiple synonyms matching network to leverage synonyms for better code representation learning, and finally help the code classification. Experiments on the MIMIC-III dataset show that our proposed method outperforms previous state-of-the-art methods.
Anthology ID:
2022.acl-short.91
Volume:
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers)
Month:
May
Year:
2022
Address:
Dublin, Ireland
Editors:
Smaranda Muresan, Preslav Nakov, Aline Villavicencio
Venue:
ACL
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
808–814
Language:
URL:
https://aclanthology.org/2022.acl-short.91
DOI:
10.18653/v1/2022.acl-short.91
Bibkey:
Cite (ACL):
Zheng Yuan, Chuanqi Tan, and Songfang Huang. 2022. Code Synonyms Do Matter: Multiple Synonyms Matching Network for Automatic ICD Coding. In Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), pages 808–814, Dublin, Ireland. Association for Computational Linguistics.
Cite (Informal):
Code Synonyms Do Matter: Multiple Synonyms Matching Network for Automatic ICD Coding (Yuan et al., ACL 2022)
Copy Citation:
PDF:
https://aclanthology.org/2022.acl-short.91.pdf
Software:
 2022.acl-short.91.software.zip
Code
 ganjinzero/icd-msmn
Data
MIMIC-III