SussexAI at ArAIEval Shared Task: Mitigating Class Imbalance in Arabic Propaganda Detection

Mary Fouad, Julie Weeds


Abstract
In this paper, we are exploring mitigating class imbalancein Arabic propaganda detection. Given amultigenre text which could be a news paragraphor a tweet, the objective is to identify the propagandatechnique employed in the text along withthe exact span(s) where each technique occurs. Weapproach this task as a sequence tagging task. Weutilise AraBERT for sequence classification andimplement data augmentation and random truncationmethods to mitigate the class imbalance withinthe dataset. We demonstrate the importance ofconsidering macro-F1 as well as micro-F1 whenevaluating classifier performance in this scenario.
Anthology ID:
2024.arabicnlp-1.55
Volume:
Proceedings of The Second Arabic Natural Language Processing Conference
Month:
August
Year:
2024
Address:
Bangkok, Thailand
Editors:
Nizar Habash, Houda Bouamor, Ramy Eskander, Nadi Tomeh, Ibrahim Abu Farha, Ahmed Abdelali, Samia Touileb, Injy Hamed, Yaser Onaizan, Bashar Alhafni, Wissam Antoun, Salam Khalifa, Hatem Haddad, Imed Zitouni, Badr AlKhamissi, Rawan Almatham, Khalil Mrini
Venues:
ArabicNLP | WS
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
524–529
Language:
URL:
https://aclanthology.org/2024.arabicnlp-1.55
DOI:
10.18653/v1/2024.arabicnlp-1.55
Bibkey:
Cite (ACL):
Mary Fouad and Julie Weeds. 2024. SussexAI at ArAIEval Shared Task: Mitigating Class Imbalance in Arabic Propaganda Detection. In Proceedings of The Second Arabic Natural Language Processing Conference, pages 524–529, Bangkok, Thailand. Association for Computational Linguistics.
Cite (Informal):
SussexAI at ArAIEval Shared Task: Mitigating Class Imbalance in Arabic Propaganda Detection (Fouad & Weeds, ArabicNLP-WS 2024)
Copy Citation:
PDF:
https://aclanthology.org/2024.arabicnlp-1.55.pdf