IITD at SemEval-2023 Task 2: A Multi-Stage Information Retrieval Approach for Fine-Grained Named Entity Recognition

Shivani Choudhary; Niladri Chatterjee; Subir Saha

doi:10.18653/v1/2023.semeval-1.111

IITD at SemEval-2023 Task 2: A Multi-Stage Information Retrieval Approach for Fine-Grained Named Entity Recognition

Shivani Choudhary, Niladri Chatterjee, Subir Saha

Abstract

MultiCoNER-II is a fine-grained Named Entity Recognition (NER) task that aims to identify ambiguous and complex named entities in multiple languages, with a small amount of contextual information available. To address this task, we propose a multi-stage information retrieval (IR) pipeline that improves the performance of language models for fine-grained NER. Our approach involves leveraging a combination of a BM25-based IR model and a language model to retrieve relevant passages from a corpus. These passages are then used to train a model that utilizes a weighted average of losses. The prediction is generated by a decoder stack that includes a projection layer and conditional random field. To demonstrate the effectiveness of our approach, we participated in the English track of the MultiCoNER-II competition. Our approach yielded promising results, which we validated through detailed analysis.

Anthology ID:: 2023.semeval-1.111
Volume:: Proceedings of the 17th International Workshop on Semantic Evaluation (SemEval-2023)
Month:: July
Year:: 2023
Address:: Toronto, Canada
Editors:: Atul Kr. Ojha, A. Seza Doğruöz, Giovanni Da San Martino, Harish Tayyar Madabushi, Ritesh Kumar, Elisa Sartori
Venue:: SemEval
SIG:: SIGLEX
Publisher:: Association for Computational Linguistics
Note:
Pages:: 800–806
Language:
URL:: https://aclanthology.org/2023.semeval-1.111/
DOI:: 10.18653/v1/2023.semeval-1.111
Bibkey:
Cite (ACL):: Shivani Choudhary, Niladri Chatterjee, and Subir Saha. 2023. IITD at SemEval-2023 Task 2: A Multi-Stage Information Retrieval Approach for Fine-Grained Named Entity Recognition. In Proceedings of the 17th International Workshop on Semantic Evaluation (SemEval-2023), pages 800–806, Toronto, Canada. Association for Computational Linguistics.
Cite (Informal):: IITD at SemEval-2023 Task 2: A Multi-Stage Information Retrieval Approach for Fine-Grained Named Entity Recognition (Choudhary et al., SemEval 2023)
Copy Citation:
PDF:: https://aclanthology.org/2023.semeval-1.111.pdf

PDF Cite Search Fix data