NUS-IDS at AMIYA/VarDial 2026: Improving Arabic Dialectness in LLMs with Reinforcement Learning

Sujatha Das Gollapalli; Mouad Hakam; Mingzhe Du; See Kiong Ng

NUS-IDS at AMIYA/VarDial 2026: Improving Arabic Dialectness in LLMs with Reinforcement Learning

Sujatha Das Gollapalli, Mouad Hakam, Mingzhe Du, See-Kiong Ng

Abstract

In this paper, we describe models developed by our team, NUS-IDS, for the Closed data track at the Arabic Modeling In Your Accent (AMIYA) shared task at VarDial 2026. The core idea behind our solution involves data augmentation enabled by a dialect classifier trained on AMIYA data. We effectively combine various translation, summarization, and question answering prompts with AMIYA training data to form dialectal prompts for use with state-of-the-art LLMs. Next, dialect predictions from our classifier on outputs from these LLMs are used to compile preference data for Reinforcement Learning (RL). We report model performance on dialectal Arabic from Egypt, Morocco, Palestine, Saudi Arabia and Syria using FLORES+, a multilingual machine translation dataset. Our experiments illustrate that though our RL models show significant performance gains on dialectness scores, they under perform on translation metrics such as chrF++ compared to base LLMs.

Anthology ID:: 2026.vardial-1.30
Volume:: Proceedings of the 13th Workshop on NLP for Similar Languages, Varieties and Dialects
Month:: March
Year:: 2026
Address:: Rabat, Morocco
Venues:: VarDial | WS
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 365–372
Language:
URL:: https://aclanthology.org/2026.vardial-1.30/
DOI:
Bibkey:
Cite (ACL):: Sujatha Das Gollapalli, Mouad Hakam, Mingzhe Du, and See-Kiong Ng. 2026. NUS-IDS at AMIYA/VarDial 2026: Improving Arabic Dialectness in LLMs with Reinforcement Learning. In Proceedings of the 13th Workshop on NLP for Similar Languages, Varieties and Dialects, pages 365–372, Rabat, Morocco. Association for Computational Linguistics.
Cite (Informal):: NUS-IDS at AMIYA/VarDial 2026: Improving Arabic Dialectness in LLMs with Reinforcement Learning (Gollapalli et al., VarDial 2026)
Copy Citation:
PDF:: https://aclanthology.org/2026.vardial-1.30.pdf

PDF Cite Search Fix data