A Morphological Analyzer for Gulf Arabic Verbs

Salam Khalifa, Sara Hassan, Nizar Habash


Abstract
We present CALIMAGLF, a Gulf Arabic morphological analyzer currently covering over 2,600 verbal lemmas. We describe in detail the process of building the analyzer starting from phonetic dictionary entries to fully inflected orthographic paradigms and associated lexicon and orthographic variants. We evaluate the coverage of CALIMA-GLF against Modern Standard Arabic and Egyptian Arabic analyzers on part of a Gulf Arabic novel. CALIMA-GLF verb analysis token recall for identifying correct POS tag outperforms both the Modern Standard Arabic and Egyptian Arabic analyzers by over 27.4% and 16.9% absolute, respectively.
Anthology ID:
W17-1305
Volume:
Proceedings of the Third Arabic Natural Language Processing Workshop
Month:
April
Year:
2017
Address:
Valencia, Spain
Editors:
Nizar Habash, Mona Diab, Kareem Darwish, Wassim El-Hajj, Hend Al-Khalifa, Houda Bouamor, Nadi Tomeh, Mahmoud El-Haj, Wajdi Zaghouani
Venue:
WANLP
SIG:
SEMITIC
Publisher:
Association for Computational Linguistics
Note:
Pages:
35–45
Language:
URL:
https://aclanthology.org/W17-1305
DOI:
10.18653/v1/W17-1305
Bibkey:
Cite (ACL):
Salam Khalifa, Sara Hassan, and Nizar Habash. 2017. A Morphological Analyzer for Gulf Arabic Verbs. In Proceedings of the Third Arabic Natural Language Processing Workshop, pages 35–45, Valencia, Spain. Association for Computational Linguistics.
Cite (Informal):
A Morphological Analyzer for Gulf Arabic Verbs (Khalifa et al., WANLP 2017)
Copy Citation:
PDF:
https://aclanthology.org/W17-1305.pdf
Data
Gumar Corpus