Arabic-Centric Large Language Models for Dialectal Arabic Sentiment Analysis Task

Salwa Saad Alahmari; Eric Atwell; Hadeel Saadany; Mohammad Alsalka

Arabic-Centric Large Language Models for Dialectal Arabic Sentiment Analysis Task

Salwa Saad Alahmari, Eric Atwell, Hadeel Saadany, Mohammad Alsalka

Abstract

This paper presents a study on sentiment anal- ysis of Dialectal Arabic (DA), with a particu- lar focus on Saudi and Moroccan (Darija) di- alects within the hospitality domain. We in- troduce a novel dataset comprising 698 Saudi Arabian proverbs annotated with sentiment polarity labels—Positive, Negative, and Neu- tral—collected from five major Saudi dialect regions: Najdi, Hijazi, Shamali, Janoubi, and Sharqawi. In addition to this, we used customer reviews for fine-tuning the CAMeLBERT-DA- SA model, which achieved a 75% F1 score in sentiment classification. To further evaluate the robustness of Arabic-centric models, we assessed the performance of three open-source large language models—Allam, ACeGPT, and Jais—in a zero-shot setting using the Ahasis shared task test set. Our results highlight the effectiveness of domain-specific fine-tuning in improving sentiment analysis performance and demonstrate the potential of Arabic-centric LLMs in zero-shot scenarios. This work con- tributes new linguistic resources and empirical insights to support ongoing research in senti- ment analysis for Arabic dialect

Anthology ID:: 2025.ranlp-ahasis.11
Volume:: Proceedings of the Shared Task on Sentiment Analysis for Arabic Dialects
Month:: September
Year:: 2025
Address:: Varna, Bulgaria
Editors:: Maram Alharbi, Salmane Chafik, Saad Ezzini, Ruslan Mitkov, Tharindu Ranasinghe, Hansi Hettiarachchi
Venues:: RANLP | WS
SIG:
Publisher:: INCOMA Ltd., Shoumen, Bulgaria
Note:
Pages:: 69–75
Language:
URL:: https://aclanthology.org/2025.ranlp-ahasis.11/
DOI:
Bibkey:
Cite (ACL):: Salwa Saad Alahmari, Eric Atwell, Hadeel Saadany, and Mohammad Alsalka. 2025. Arabic-Centric Large Language Models for Dialectal Arabic Sentiment Analysis Task. In Proceedings of the Shared Task on Sentiment Analysis for Arabic Dialects, pages 69–75, Varna, Bulgaria. INCOMA Ltd., Shoumen, Bulgaria.
Cite (Informal):: Arabic-Centric Large Language Models for Dialectal Arabic Sentiment Analysis Task (Alahmari et al., RANLP 2025)
Copy Citation:
PDF:: https://aclanthology.org/2025.ranlp-ahasis.11.pdf

PDF Cite Search Fix data