Ahasis Shared Task: Hybrid Lexicon-Augmented AraBERT Model for Sentiment Detection in Arabic Dialects

Shimaa Amer Ibrahim, Mabrouka Bessghaier, Wajdi Zaghouani


Abstract
This work was conducted as part of the Ahasis@RANLP–2025 shared task, which focuses on sentiment detection in Arabic dialects within the hotel review domain. The primary objective is to advance sentiment analysis methodologies tailored to dialectal Arabic. Our work combines data augmentation with a hybrid model that integrates AraBERT and our created sentiment lexicon. Notably, our hybrid model significantly improved performance, reaching an F1-score of 0.74, compared to 0.56 when using only AraBERT. These results highlight the effectiveness of lexicon integration and augmentation strategies in enhancing both the accuracy and robustness of sentiment classification in dialectal Arabic.
Anthology ID:
2025.ranlp-ahasis.5
Volume:
Proceedings of the Shared Task on Sentiment Analysis for Arabic Dialects
Month:
September
Year:
2025
Address:
Varna, Bulgaria
Editors:
Maram Alharbi, Salmane Chafik, Saad Ezzini, Ruslan Mitkov, Tharindu Ranasinghe, Hansi Hettiarachchi
Venues:
RANLP | WS
SIG:
Publisher:
INCOMA Ltd., Shoumen, Bulgaria
Note:
Pages:
29–34
Language:
URL:
https://aclanthology.org/2025.ranlp-ahasis.5/
DOI:
Bibkey:
Cite (ACL):
Shimaa Amer Ibrahim, Mabrouka Bessghaier, and Wajdi Zaghouani. 2025. Ahasis Shared Task: Hybrid Lexicon-Augmented AraBERT Model for Sentiment Detection in Arabic Dialects. In Proceedings of the Shared Task on Sentiment Analysis for Arabic Dialects, pages 29–34, Varna, Bulgaria. INCOMA Ltd., Shoumen, Bulgaria.
Cite (Informal):
Ahasis Shared Task: Hybrid Lexicon-Augmented AraBERT Model for Sentiment Detection in Arabic Dialects (Ibrahim et al., RANLP 2025)
Copy Citation:
PDF:
https://aclanthology.org/2025.ranlp-ahasis.5.pdf