CoToHiLi at SIGTYP 2023: Ensemble Models for Cognate and Derivative Words Detection

Liviu P. Dinu, Ioan-Bogdan Iordache, Ana Sabina Uban


Abstract
The identification of cognates and derivatives is a fundamental process in historical linguistics, on which any further research is based. In this paper we present our contribution to the SIGTYP 2023 Shared Task on cognate and derivative detection. We propose a multi-lingual solution based on features extracted from the alignment of the orthographic and phonetic representations of the words.
Anthology ID:
2023.sigtyp-1.15
Volume:
Proceedings of the 5th Workshop on Research in Computational Linguistic Typology and Multilingual NLP
Month:
May
Year:
2023
Address:
Dubrovnik, Croatia
Editors:
Lisa Beinborn, Koustava Goswami, Saliha Muradoğlu, Alexey Sorokin, Ritesh Kumar, Andreas Shcherbakov, Edoardo M. Ponti, Ryan Cotterell, Ekaterina Vylomova
Venue:
SIGTYP
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
137–142
Language:
URL:
https://aclanthology.org/2023.sigtyp-1.15
DOI:
10.18653/v1/2023.sigtyp-1.15
Bibkey:
Cite (ACL):
Liviu P. Dinu, Ioan-Bogdan Iordache, and Ana Sabina Uban. 2023. CoToHiLi at SIGTYP 2023: Ensemble Models for Cognate and Derivative Words Detection. In Proceedings of the 5th Workshop on Research in Computational Linguistic Typology and Multilingual NLP, pages 137–142, Dubrovnik, Croatia. Association for Computational Linguistics.
Cite (Informal):
CoToHiLi at SIGTYP 2023: Ensemble Models for Cognate and Derivative Words Detection (Dinu et al., SIGTYP 2023)
Copy Citation:
PDF:
https://aclanthology.org/2023.sigtyp-1.15.pdf