Revisiting and Amending Central Kurdish Data on UniMorph 4.0

Sina Ahmadi, Aso Mahmudi


Abstract
UniMorph–the Universal Morphology project is a collaborative initiative to create and maintain morphological data and organize numerous related tasks for various language processing communities. The morphological data is provided by linguists for over 160 languages in the latest version of UniMorph 4.0. This paper sheds light on the Central Kurdish data on UniMorph 4.0 by analyzing the existing data, its fallacies, and systematic morphological errors. It also presents an approach to creating more reliable morphological data by considering various specific phenomena in Central Kurdish that have not been addressed previously, such as Izafe and several enclitics.
Anthology ID:
2023.sigmorphon-1.5
Volume:
Proceedings of the 20th SIGMORPHON workshop on Computational Research in Phonetics, Phonology, and Morphology
Month:
July
Year:
2023
Address:
Toronto, Canada
Editors:
Garrett Nicolai, Eleanor Chodroff, Frederic Mailhot, Çağrı Çöltekin
Venue:
SIGMORPHON
SIG:
SIGMORPHON
Publisher:
Association for Computational Linguistics
Note:
Pages:
38–48
Language:
URL:
https://aclanthology.org/2023.sigmorphon-1.5
DOI:
10.18653/v1/2023.sigmorphon-1.5
Bibkey:
Cite (ACL):
Sina Ahmadi and Aso Mahmudi. 2023. Revisiting and Amending Central Kurdish Data on UniMorph 4.0. In Proceedings of the 20th SIGMORPHON workshop on Computational Research in Phonetics, Phonology, and Morphology, pages 38–48, Toronto, Canada. Association for Computational Linguistics.
Cite (Informal):
Revisiting and Amending Central Kurdish Data on UniMorph 4.0 (Ahmadi & Mahmudi, SIGMORPHON 2023)
Copy Citation:
PDF:
https://aclanthology.org/2023.sigmorphon-1.5.pdf
Video:
 https://aclanthology.org/2023.sigmorphon-1.5.mp4