Exploring the Use of Foundation Models for Named Entity Recognition and Lemmatization Tasks in Slavic Languages

Gabriela Pałka, Artur Nowakowski


Abstract
This paper describes Adam Mickiewicz University’s (AMU) solution for the 4th Shared Task on SlavNER. The task involves the identification, categorization, and lemmatization of named entities in Slavic languages. Our approach involved exploring the use of foundation models for these tasks. In particular, we used models based on the popular BERT and T5 model architectures. Additionally, we used external datasets to further improve the quality of our models. Our solution obtained promising results, achieving high metrics scores in both tasks. We describe our approach and the results of our experiments in detail, showing that the method is effective for NER and lemmatization in Slavic languages. Additionally, our models for lemmatization will be available at: https://huggingface.co/amu-cai.
Anthology ID:
2023.bsnlp-1.19
Volume:
Proceedings of the 9th Workshop on Slavic Natural Language Processing 2023 (SlavicNLP 2023)
Month:
May
Year:
2023
Address:
Dubrovnik, Croatia
Editors:
Jakub Piskorski, Michał Marcińczuk, Preslav Nakov, Maciej Ogrodniczuk, Senja Pollak, Pavel Přibáň, Piotr Rybak, Josef Steinberger, Roman Yangarber
Venue:
BSNLP
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
165–171
Language:
URL:
https://aclanthology.org/2023.bsnlp-1.19
DOI:
10.18653/v1/2023.bsnlp-1.19
Bibkey:
Cite (ACL):
Gabriela Pałka and Artur Nowakowski. 2023. Exploring the Use of Foundation Models for Named Entity Recognition and Lemmatization Tasks in Slavic Languages. In Proceedings of the 9th Workshop on Slavic Natural Language Processing 2023 (SlavicNLP 2023), pages 165–171, Dubrovnik, Croatia. Association for Computational Linguistics.
Cite (Informal):
Exploring the Use of Foundation Models for Named Entity Recognition and Lemmatization Tasks in Slavic Languages (Pałka & Nowakowski, BSNLP 2023)
Copy Citation:
PDF:
https://aclanthology.org/2023.bsnlp-1.19.pdf
Video:
 https://aclanthology.org/2023.bsnlp-1.19.mp4