AAST-NLP at ArAIEval Shared Task: Tackling Persuasion technique and Disinformation Detection using Pre-Trained Language Models On Imbalanced Datasets

Ahmed El-Sayed; Omar Nasr; Noureldin Elmadany

doi:10.18653/v1/2023.arabicnlp-1.56

AAST-NLP at ArAIEval Shared Task: Tackling Persuasion technique and Disinformation Detection using Pre-Trained Language Models On Imbalanced Datasets

Ahmed El-Sayed, Omar Nasr, Noureldin Elmadany

Abstract

This paper presents the pipeline developed by the AAST-NLP team to address both the persuasion technique detection and disinformation detection shared tasks. The proposed system for all the tasks’ sub-tasks consisted of preprocessing the data and finetuning AraBERT on the given datasets, in addition to several procedures performed for each subtask to adapt to the problems faced in it. The previously described system was used in addition to Dice loss as the loss function for sub-task 1A, which consisted of a binary classification problem. In that sub-task, the system came in eleventh place. We trained the AraBERT for task 1B, which was a multi-label problem with 24 distinct labels, using binary cross-entropy to train a classifier for each label. On that sub-task, the system came in third place. We utilised AraBERT with Dice loss on both subtasks 2A and 2B, ranking second and third among the proposed models for the respective subtasks.

Anthology ID:: 2023.arabicnlp-1.56
Volume:: Proceedings of ArabicNLP 2023
Month:: December
Year:: 2023
Address:: Singapore (Hybrid)
Editors:: Hassan Sawaf, Samhaa El-Beltagy, Wajdi Zaghouani, Walid Magdy, Ahmed Abdelali, Nadi Tomeh, Ibrahim Abu Farha, Nizar Habash, Salam Khalifa, Amr Keleg, Hatem Haddad, Imed Zitouni, Khalil Mrini, Rawan Almatham
Venues:: ArabicNLP | WS
SIG:: SIGARAB
Publisher:: Association for Computational Linguistics
Note:
Pages:: 565–569
Language:
URL:: https://aclanthology.org/2023.arabicnlp-1.56/
DOI:: 10.18653/v1/2023.arabicnlp-1.56
Bibkey:
Cite (ACL):: Ahmed El-Sayed, Omar Nasr, and Noureldin Elmadany. 2023. AAST-NLP at ArAIEval Shared Task: Tackling Persuasion technique and Disinformation Detection using Pre-Trained Language Models On Imbalanced Datasets. In Proceedings of ArabicNLP 2023, pages 565–569, Singapore (Hybrid). Association for Computational Linguistics.
Cite (Informal):: AAST-NLP at ArAIEval Shared Task: Tackling Persuasion technique and Disinformation Detection using Pre-Trained Language Models On Imbalanced Datasets (El-Sayed et al., ArabicNLP 2023)
Copy Citation:
PDF:: https://aclanthology.org/2023.arabicnlp-1.56.pdf
Video:: https://aclanthology.org/2023.arabicnlp-1.56.mp4

PDF Cite Search Video Fix data