Haobin Yuan
2025
System Report for CCL25-Eval Task 8: Improving ICD Coding with Large Language Models via Disease Entity Recognition
Tengxiao Lv | Juntao Li | Chao Liu | Haobin Yuan | Ling Luo | Jian Wang | Hongfei Lin
Proceedings of the 24th China National Conference on Computational Linguistics (CCL 2025)
Tengxiao Lv | Juntao Li | Chao Liu | Haobin Yuan | Ling Luo | Jian Wang | Hongfei Lin
Proceedings of the 24th China National Conference on Computational Linguistics (CCL 2025)
"With the widespread adoption of Electronic Medical Records (EMRs), automated coding of theInternational Classification of Diseases (ICD) has become increasingly essential. However, the complexity of Chinese clinical texts presents significant challenges to traditional methods. To address these issues, CCL25-Eval Task 8 organized the Chinese EMRs ICD Diagnosis CodingEvaluation. This paper presents a method based on Large Language Models (LLMs), which divides the task into primary and other diagnosis coding. For the primary diagnosis, a confidence-guided semantic retrieval strategy is applied, while ensemble learning enhanced with NamedEntity Recognition (NER) is used for other diagnoses. The proposed approach achieved 83.42%accuracy on the official test set, ranking second in the evaluation."