Morphotactic Modeling in an Open-source Multi-dialectal Arabic Morphological Analyzer and Generator

Nizar Habash, Reham Marzouk, Christian Khairallah, Salam Khalifa


Abstract
Arabic is a morphologically rich and complex language, with numerous dialectal variants. Previous efforts on Arabic morphology modeling focused on specific variants and specific domains using a range of techniques with different degrees of linguistic modeling transparency. In this paper we propose a new approach to modeling Arabic morphology with an eye towards multi-dialectness, resource openness, and easy extensibility and use. We demonstrate our approach by modeling verbs from Standard Arabic and Egyptian Arabic, within a common framework, and with high coverage.
Anthology ID:
2022.sigmorphon-1.10
Volume:
Proceedings of the 19th SIGMORPHON Workshop on Computational Research in Phonetics, Phonology, and Morphology
Month:
July
Year:
2022
Address:
Seattle, Washington
Editors:
Garrett Nicolai, Eleanor Chodroff
Venue:
SIGMORPHON
SIG:
SIGMORPHON
Publisher:
Association for Computational Linguistics
Note:
Pages:
92–102
Language:
URL:
https://aclanthology.org/2022.sigmorphon-1.10
DOI:
10.18653/v1/2022.sigmorphon-1.10
Bibkey:
Cite (ACL):
Nizar Habash, Reham Marzouk, Christian Khairallah, and Salam Khalifa. 2022. Morphotactic Modeling in an Open-source Multi-dialectal Arabic Morphological Analyzer and Generator. In Proceedings of the 19th SIGMORPHON Workshop on Computational Research in Phonetics, Phonology, and Morphology, pages 92–102, Seattle, Washington. Association for Computational Linguistics.
Cite (Informal):
Morphotactic Modeling in an Open-source Multi-dialectal Arabic Morphological Analyzer and Generator (Habash et al., SIGMORPHON 2022)
Copy Citation:
PDF:
https://aclanthology.org/2022.sigmorphon-1.10.pdf
Video:
 https://aclanthology.org/2022.sigmorphon-1.10.mp4
Code
 CAMeL-Lab/camel_morph