REGLAT at AbjadGenEval: Multi-Model Ensemble Approach for Arabic AI-Generated Text Detection

Mariam Labib Francies, Nsrin Ashraf, Ahmed Megahed Fetouh, Hamada Nayel


Abstract
The rapid advancement of large language models necessitates robust methods for detecting AI-generated Arabic text. This paper presents our system for distinguishing human-written from machine-generated Arabic content. We propose a weighted ensemble combining AraBERTv2 and BERT-base-arabic, trained via 5-fold stratified cross-validation with class-balanced loss functions. Our methodology incorporates Arabic text normalization, strategic data augmentation using 16,678 samples from external scientific abstracts, and threshold optimization prioritizing recall. On the official test set, our system achieved an F1-score of 0.763, an accuracy of 0.695, a precision of 0.624, and a recall of 0.980, demonstrating strong detection of machine-generated texts with minimal false negatives at the cost of elevated false positives. Analysis reveals critical insights into precision-recall trade-offs and challenges in cross-domain generalization for Arabic AI text detection.
Anthology ID:
2026.abjadnlp-1.62
Volume:
Proceedings of the 2nd Workshop on NLP for Languages Using Arabic Script
Month:
March
Year:
2026
Address:
Rabat, Morocco
Venues:
AbjadNLP | WS
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
493–496
Language:
URL:
https://aclanthology.org/2026.abjadnlp-1.62/
DOI:
Bibkey:
Cite (ACL):
Mariam Labib Francies, Nsrin Ashraf, Ahmed Megahed Fetouh, and Hamada Nayel. 2026. REGLAT at AbjadGenEval: Multi-Model Ensemble Approach for Arabic AI-Generated Text Detection. In Proceedings of the 2nd Workshop on NLP for Languages Using Arabic Script, pages 493–496, Rabat, Morocco. Association for Computational Linguistics.
Cite (Informal):
REGLAT at AbjadGenEval: Multi-Model Ensemble Approach for Arabic AI-Generated Text Detection (Francies et al., AbjadNLP 2026)
Copy Citation:
PDF:
https://aclanthology.org/2026.abjadnlp-1.62.pdf