Picking Up Where the Linguist Left Off: Mapping Morphology to Phonology through Learning the Residuals

Salam Khalifa, Abdelrahim Qaddoumi, Ellen Broselow, Owen Rambow


Abstract
Learning morphophonological mappings between the spoken form of a language and its underlying morphological structures is crucial for enriching resources for morphologically rich languages like Arabic. In this work, we focus on Egyptian Arabic as our case study and explore the integration of linguistic knowledge with a neural transformer model. Our approach involves learning to correct the residual errors from hand-crafted rules to predict the spoken form from a given underlying morphological representation. We demonstrate that using a minimal set of rules, we can effectively recover errors even in very low-resource settings.
Anthology ID:
2024.arabicnlp-1.22
Volume:
Proceedings of The Second Arabic Natural Language Processing Conference
Month:
August
Year:
2024
Address:
Bangkok, Thailand
Editors:
Nizar Habash, Houda Bouamor, Ramy Eskander, Nadi Tomeh, Ibrahim Abu Farha, Ahmed Abdelali, Samia Touileb, Injy Hamed, Yaser Onaizan, Bashar Alhafni, Wissam Antoun, Salam Khalifa, Hatem Haddad, Imed Zitouni, Badr AlKhamissi, Rawan Almatham, Khalil Mrini
Venues:
ArabicNLP | WS
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
258–264
Language:
URL:
https://aclanthology.org/2024.arabicnlp-1.22
DOI:
Bibkey:
Cite (ACL):
Salam Khalifa, Abdelrahim Qaddoumi, Ellen Broselow, and Owen Rambow. 2024. Picking Up Where the Linguist Left Off: Mapping Morphology to Phonology through Learning the Residuals. In Proceedings of The Second Arabic Natural Language Processing Conference, pages 258–264, Bangkok, Thailand. Association for Computational Linguistics.
Cite (Informal):
Picking Up Where the Linguist Left Off: Mapping Morphology to Phonology through Learning the Residuals (Khalifa et al., ArabicNLP-WS 2024)
Copy Citation:
PDF:
https://aclanthology.org/2024.arabicnlp-1.22.pdf