Applications of BERT Models Towards Automation of Clinical Coding in Icelandic

Haraldur Orri Hauksson; Hafsteinn Einarsson

doi:10.18653/v1/2024.findings-naacl.127

Applications of BERT Models Towards Automation of Clinical Coding in Icelandic

Haraldur Orri Hauksson, Hafsteinn Einarsson

Abstract

This study explores the potential of automating clinical coding in Icelandic, a language with limited digital resources, by leveraging over 25 years of electronic health records (EHR) from the Landspitali University Hospital. Traditionally a manual and error-prone task, clinical coding is essential for patient care, billing, and research. Our research delves into the effectiveness of Transformer-based models in automating this process. We investigate various model training strategies, including continued pretraining and model adaptation, under a constrained computational budget. Our findings reveal that the best-performing model achieves competitive results in both micro and macro F1 scores, with label attention contributing significantly to its success. The study also explores the possibility of training on unlabeled data. Our research provides valuable insights into the possibilities of using NLP for clinical coding in low-resource languages, demonstrating that small countries with unique languages and well-segmented healthcare records can achieve results comparable to those in higher-resourced languages.

Anthology ID:: 2024.findings-naacl.127
Volume:: Findings of the Association for Computational Linguistics: NAACL 2024
Month:: June
Year:: 2024
Address:: Mexico City, Mexico
Editors:: Kevin Duh, Helena Gomez, Steven Bethard
Venue:: Findings
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 1956–1967
Language:
URL:: https://aclanthology.org/2024.findings-naacl.127/
DOI:: 10.18653/v1/2024.findings-naacl.127
Bibkey:
Cite (ACL):: Haraldur Orri Hauksson and Hafsteinn Einarsson. 2024. Applications of BERT Models Towards Automation of Clinical Coding in Icelandic. In Findings of the Association for Computational Linguistics: NAACL 2024, pages 1956–1967, Mexico City, Mexico. Association for Computational Linguistics.
Cite (Informal):: Applications of BERT Models Towards Automation of Clinical Coding in Icelandic (Hauksson & Einarsson, Findings 2024)
Copy Citation:
PDF:: https://aclanthology.org/2024.findings-naacl.127.pdf
Video:: https://aclanthology.org/2024.findings-naacl.127.mp4

PDF Cite Search Video Fix data