A Comparison between Named Entity Recognition Models in the Biomedical Domain

Maria Carmela Cariello, Alessandro Lenci, Ruslan Mitkov


Abstract
The domain-specialised application of Named Entity Recognition (NER) is known as Biomedical NER (BioNER), which aims to identify and classify biomedical concepts that are of interest to researchers, such as genes, proteins, chemical compounds, drugs, mutations, diseases, and so on. The BioNER task is very similar to general NER but recognising Biomedical Named Entities (BNEs) is more challenging than recognising proper names from newspapers due to the characteristics of biomedical nomenclature. In order to address the challenges posed by BioNER, seven machine learning models were implemented comparing a transfer learning approach based on fine-tuned BERT with Bi-LSTM based neural models and a CRF model used as baseline. Precision, Recall and F1-score were used as performance scores evaluating the models on two well-known biomedical corpora: JNLPBA and BIOCREATIVE IV (BC-IV). Strict and partial matching were considered as evaluation criteria. The reported results show that a transfer learning approach based on fine-tuned BERT outperforms all others methods achieving the highest scores for all metrics on both corpora.
Anthology ID:
2021.triton-1.9
Volume:
Proceedings of the Translation and Interpreting Technology Online Conference
Month:
July
Year:
2021
Address:
Held Online
Editors:
Ruslan Mitkov, Vilelmini Sosoni, Julie Christine Giguère, Elena Murgolo, Elizabeth Deysel
Venue:
TRITON
SIG:
Publisher:
INCOMA Ltd.
Note:
Pages:
76–84
Language:
URL:
https://aclanthology.org/2021.triton-1.9
DOI:
Bibkey:
Cite (ACL):
Maria Carmela Cariello, Alessandro Lenci, and Ruslan Mitkov. 2021. A Comparison between Named Entity Recognition Models in the Biomedical Domain. In Proceedings of the Translation and Interpreting Technology Online Conference, pages 76–84, Held Online. INCOMA Ltd..
Cite (Informal):
A Comparison between Named Entity Recognition Models in the Biomedical Domain (Cariello et al., TRITON 2021)
Copy Citation:
PDF:
https://aclanthology.org/2021.triton-1.9.pdf
Code
 cariello1/bioner