Aladdin-FTI @ AMIYA Three Wishes for Arabic NLP: Fidelity, Diglossia, and Multidialectal Generation

Jonathan Mutal, Perla Al Almaoui, Simon Hengchen, Pierrette Bouillon


Abstract
Arabic dialects have long been under-represented in Natural Language Processing (NLP) research due to their non-standardization and high variability, which pose challenges for computational modeling. Recent advances in the field, such as Large Language Models (LLMs), offer promising avenues to address this gap by enabling Arabic to be modeled as a pluricentric language rather than a monolithic system. This paper presents Aladdin-FTI, our submission to the AMIYA shared task. The proposed system is designed to both generate and translate dialectal Arabic (DA). Specifically, the model supports text generation in Moroccan, Egyptian, Palestinian, Syrian, and Saudi dialects, as well as bidirectional translation between these dialects, Modern Standard Arabic (MSA), and English. The code and trained model will be released upon paper acceptance.
Anthology ID:
2026.vardial-1.27
Volume:
Proceedings of the 13th Workshop on NLP for Similar Languages, Varieties and Dialects
Month:
March
Year:
2026
Address:
Rabat, Morocco
Venues:
VarDial | WS
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
339–351
Language:
URL:
https://aclanthology.org/2026.vardial-1.27/
DOI:
Bibkey:
Cite (ACL):
Jonathan Mutal, Perla Al Almaoui, Simon Hengchen, and Pierrette Bouillon. 2026. Aladdin-FTI @ AMIYA Three Wishes for Arabic NLP: Fidelity, Diglossia, and Multidialectal Generation. In Proceedings of the 13th Workshop on NLP for Similar Languages, Varieties and Dialects, pages 339–351, Rabat, Morocco. Association for Computational Linguistics.
Cite (Informal):
Aladdin-FTI @ AMIYA Three Wishes for Arabic NLP: Fidelity, Diglossia, and Multidialectal Generation (Mutal et al., VarDial 2026)
Copy Citation:
PDF:
https://aclanthology.org/2026.vardial-1.27.pdf