MLlab4CS at SemEval-2023 Task 2: Named Entity Recognition in Low-resource Language Bangla Using Multilingual Language Models

Shrimon Mukherjee, Madhusudan Ghosh, Girish, Partha Basuchowdhuri


Abstract
Extracting of NERs from low-resource languages and recognizing their types is one of the important tasks in the entity extraction domain. Recently many studies have been conducted in this area of research. In our study, we introduce a system for identifying complex entities and recognizing their types from low-resource language Bangla, which was published in SemEval Task 2 MulitCoNER II 2023. For this sequence labeling task, we use a pre-trained language model built on a natural language processing framework. Our team name in this competition is MLlab4CS. Our model Muril produces a macro average F-score of 76.27%, which is a comparable result for this competition.
Anthology ID:
2023.semeval-1.192
Volume:
Proceedings of the 17th International Workshop on Semantic Evaluation (SemEval-2023)
Month:
July
Year:
2023
Address:
Toronto, Canada
Editors:
Atul Kr. Ojha, A. Seza Doğruöz, Giovanni Da San Martino, Harish Tayyar Madabushi, Ritesh Kumar, Elisa Sartori
Venue:
SemEval
SIG:
SIGLEX
Publisher:
Association for Computational Linguistics
Note:
Pages:
1388–1394
Language:
URL:
https://aclanthology.org/2023.semeval-1.192
DOI:
10.18653/v1/2023.semeval-1.192
Bibkey:
Cite (ACL):
Shrimon Mukherjee, Madhusudan Ghosh, Girish, and Partha Basuchowdhuri. 2023. MLlab4CS at SemEval-2023 Task 2: Named Entity Recognition in Low-resource Language Bangla Using Multilingual Language Models. In Proceedings of the 17th International Workshop on Semantic Evaluation (SemEval-2023), pages 1388–1394, Toronto, Canada. Association for Computational Linguistics.
Cite (Informal):
MLlab4CS at SemEval-2023 Task 2: Named Entity Recognition in Low-resource Language Bangla Using Multilingual Language Models (Mukherjee et al., SemEval 2023)
Copy Citation:
PDF:
https://aclanthology.org/2023.semeval-1.192.pdf
Video:
 https://aclanthology.org/2023.semeval-1.192.mp4