SCU-MESCLab at ROCLING-2022 Shared Task: Named Entity Recognition Using BERT Classifier

Tsung-Hsien Yang, Ruei-Cyuan Su, Tzu-En Su, Sing-Seong Chong, Ming-Hsiang Su


Abstract
In this study, named entity recognition is constructed and applied in the medical domain. Data is labeled in BIO format. For example, “muscle” would be labeled “B-BODY” and “I-BODY”, and “cough” would be “B-SYMP” and “I-SYMP”. All words outside the category are marked with “O”. The Chinese HealthNER Corpus contains 30,692 sentences, of which 2531 sentences are divided into the validation set (dev) for this evaluation, and the conference finally provides another 3204 sentences for the test set (test). We use BLSTM_CRF, Roberta+BLSTM_CRF and BERT Classifier to submit three prediction results respectively. Finally, the BERT Classifier system submitted as RUN3 achieved the best prediction performance, with an accuracy of 80.18%, a recall rate of 78.3%, and an F1-score of 79.23.
Anthology ID:
2022.rocling-1.41
Volume:
Proceedings of the 34th Conference on Computational Linguistics and Speech Processing (ROCLING 2022)
Month:
November
Year:
2022
Address:
Taipei, Taiwan
Editors:
Yung-Chun Chang, Yi-Chin Huang
Venue:
ROCLING
SIG:
Publisher:
The Association for Computational Linguistics and Chinese Language Processing (ACLCLP)
Note:
Pages:
329–334
Language:
Chinese
URL:
https://aclanthology.org/2022.rocling-1.41
DOI:
Bibkey:
Cite (ACL):
Tsung-Hsien Yang, Ruei-Cyuan Su, Tzu-En Su, Sing-Seong Chong, and Ming-Hsiang Su. 2022. SCU-MESCLab at ROCLING-2022 Shared Task: Named Entity Recognition Using BERT Classifier. In Proceedings of the 34th Conference on Computational Linguistics and Speech Processing (ROCLING 2022), pages 329–334, Taipei, Taiwan. The Association for Computational Linguistics and Chinese Language Processing (ACLCLP).
Cite (Informal):
SCU-MESCLab at ROCLING-2022 Shared Task: Named Entity Recognition Using BERT Classifier (Yang et al., ROCLING 2022)
Copy Citation:
PDF:
https://aclanthology.org/2022.rocling-1.41.pdf