A Multi-Task Biomedical Named Entity Recognition Method Based on Data Augmentation

Zhao Hui, Zhao Di, Meng Jiana, Liu Shuang, Lin Hongfei


Abstract
“The rapid development of artificial intelligence has led to an explosion of literature in the biomed-ical field, and Biomedical Named Entity Recognition (BioNER) can quickly and accurately iden-tify key information from unstructured text. This task has become an important topic to promotethe rapid development of intelligence in the biomedical field. However, in the Named EntityRecognition (NER) of the biomedical field, there are always some problems of unclear boundaryrecognition, the underutilization of hierarchical information in sentences and the scarcity of train-ing data resources. Based on this, this paper proposes a multi-task BioNER model based on dataaugmentation, using four data augmentation methods: Mention Replacement (MR), Label-wisetoken Replacement (LwTR), Shuffle Within Segments (SiS) and Synonym Replacement (SR)to increase the training data. The syntactic information is extracted by incorporating the inputsentence into the Graph Convolutional Network (GCN), and then the tag information encodedby BERT is interacted through a co-attention mechanism to obtain an interaction matrix. Subse-quently, NER is performed through boundary detection tasks and span classification tasks. Com-parative experiments with other methods are conducted on the BC5CDR and JNLPBA datasets,as well as the CCKS2017 dataset. The experimental results demonstrate the effectiveness of themodel proposed in this paper.”
Anthology ID:
2024.ccl-1.83
Volume:
Proceedings of the 23rd Chinese National Conference on Computational Linguistics (Volume 1: Main Conference)
Month:
July
Year:
2024
Address:
Taiyuan, China
Editors:
Maosong Sun, Jiye Liang, Xianpei Han, Zhiyuan Liu, Yulan He
Venue:
CCL
SIG:
Publisher:
Chinese Information Processing Society of China
Note:
Pages:
1075–1086
Language:
English
URL:
https://aclanthology.org/2024.ccl-1.83/
DOI:
Bibkey:
Cite (ACL):
Zhao Hui, Zhao Di, Meng Jiana, Liu Shuang, and Lin Hongfei. 2024. A Multi-Task Biomedical Named Entity Recognition Method Based on Data Augmentation. In Proceedings of the 23rd Chinese National Conference on Computational Linguistics (Volume 1: Main Conference), pages 1075–1086, Taiyuan, China. Chinese Information Processing Society of China.
Cite (Informal):
A Multi-Task Biomedical Named Entity Recognition Method Based on Data Augmentation (Hui et al., CCL 2024)
Copy Citation:
PDF:
https://aclanthology.org/2024.ccl-1.83.pdf