DRAGON at FIGNEWS 2024 Shared Task: a Dedicated RAG for October 7th conflict News

Sadegh Jafari; Mohsen Mahmoodzadeh; Vanooshe Nazari; Razieh Bahmanyar; Kathryn Burrows

doi:10.18653/v1/2024.arabicnlp-1.58

DRAGON at FIGNEWS 2024 Shared Task: a Dedicated RAG for October 7th conflict News

Sadegh Jafari, Mohsen Mahmoodzadeh, Vanooshe Nazari, Razieh Bahmanyar, Kathryn Burrows

Abstract

In this study, we present a novel approach to annotating bias and propaganda in social media data by leveraging topic modeling techniques. Utilizing the BERTopic tool, we performed topic modeling on the FIGNEWS Shared-task dataset, which initially comprised 13,500 samples. From this dataset, we identified 35 distinct topics and selected approximately 50 representative samples from each topic, resulting in a subset of 1,812 samples. These selected samples were meticulously annotated for bias and propaganda labels. Subsequently, we employed multiple methods like KNN, SVC, XGBoost, and RAG to develop a classifier capable of detecting bias and propaganda within social media content. Our approach demonstrates the efficacy of using topic modeling for efficient data subset selection and provides a robust foundation for improving the accuracy of bias and propaganda detection in large-scale social media datasets.

Anthology ID:: 2024.arabicnlp-1.58
Volume:: Proceedings of the Second Arabic Natural Language Processing Conference
Month:: August
Year:: 2024
Address:: Bangkok, Thailand
Editors:: Nizar Habash, Houda Bouamor, Ramy Eskander, Nadi Tomeh, Ibrahim Abu Farha, Ahmed Abdelali, Samia Touileb, Injy Hamed, Yaser Onaizan, Bashar Alhafni, Wissam Antoun, Salam Khalifa, Hatem Haddad, Imed Zitouni, Badr AlKhamissi, Rawan Almatham, Khalil Mrini
Venues:: ArabicNLP | WS
SIG:: SIGARAB
Publisher:: Association for Computational Linguistics
Note:
Pages:: 555–560
Language:
URL:: https://aclanthology.org/2024.arabicnlp-1.58/
DOI:: 10.18653/v1/2024.arabicnlp-1.58
Bibkey:
Cite (ACL):: Sadegh Jafari, Mohsen Mahmoodzadeh, Vanooshe Nazari, Razieh Bahmanyar, and Kathryn Burrows. 2024. DRAGON at FIGNEWS 2024 Shared Task: a Dedicated RAG for October 7th conflict News. In Proceedings of the Second Arabic Natural Language Processing Conference, pages 555–560, Bangkok, Thailand. Association for Computational Linguistics.
Cite (Informal):: DRAGON at FIGNEWS 2024 Shared Task: a Dedicated RAG for October 7th conflict News (Jafari et al., ArabicNLP 2024)
Copy Citation:
PDF:: https://aclanthology.org/2024.arabicnlp-1.58.pdf

PDF Cite Search Fix data