Named Entity Annotation Projection Applied to Classical Languages

Tariq Yousef, Chiara Palladino, Gerhard Heyer, Stefan Jänicke


Abstract
In this study, we demonstrate how to apply cross-lingual annotation projection to transfer named-entity annotations to classical languages for which limited or no resources and annotated texts are available, aiming to enrich their NER training datasets and train a model to perform NER tagging. Our method uses sentence-level aligned parallel corpora ancient texts and the translation in a modern language, for which high-quality off-the-shelf NER systems are available. We automatically annotate the text of the modern language and employ a state-of-the-art neural word alignment system to find translation equivalents. Finally, we transfer the annotations to the corresponding tokens in the ancient texts using a direct projection heuristic. We applied our method to ancient Greek, Latin, and Arabic using the Bible with the English translation as a parallel corpus. We used the resulting annotations to enhance the performance of an existing NER model for ancient Greek
Anthology ID:
2023.latechclfl-1.19
Volume:
Proceedings of the 7th Joint SIGHUM Workshop on Computational Linguistics for Cultural Heritage, Social Sciences, Humanities and Literature
Month:
May
Year:
2023
Address:
Dubrovnik, Croatia
Editors:
Stefania Degaetano-Ortlieb, Anna Kazantseva, Nils Reiter, Stan Szpakowicz
Venue:
LaTeCHCLfL
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
175–182
Language:
URL:
https://aclanthology.org/2023.latechclfl-1.19
DOI:
10.18653/v1/2023.latechclfl-1.19
Bibkey:
Cite (ACL):
Tariq Yousef, Chiara Palladino, Gerhard Heyer, and Stefan Jänicke. 2023. Named Entity Annotation Projection Applied to Classical Languages. In Proceedings of the 7th Joint SIGHUM Workshop on Computational Linguistics for Cultural Heritage, Social Sciences, Humanities and Literature, pages 175–182, Dubrovnik, Croatia. Association for Computational Linguistics.
Cite (Informal):
Named Entity Annotation Projection Applied to Classical Languages (Yousef et al., LaTeCHCLfL 2023)
Copy Citation:
PDF:
https://aclanthology.org/2023.latechclfl-1.19.pdf
Video:
 https://aclanthology.org/2023.latechclfl-1.19.mp4