A Machine Learning Framework for Detecting Hate Speech and Fake Narratives in Hindi-English Tweets

R.n. Yadawad, Sunil Saumya, K.n. Nivedh, Siddhaling S. Padanur, Sudev Basti


Abstract
This paper presents a novel system developed for the Faux-Hate Shared Task at ICON2024, addressing the detection of hate speechand fake narratives within Hindi-English code-mixed social media data. Our approach com-bines advanced text preprocessing, TF-IDFvectorization, and Random Forest classifiersto identify harmful content, while employingSMOTE to address class imbalance. By lever-aging ensemble learning and feature engineer-ing, our system demonstrates robust perfor-mance in detecting hateful and fake content,classifying targets, and evaluating the sever-ity of hate speech. The results underscore thepotential for real-world applications, such asmoderating online platforms and identifyingharmful narratives. Furthermore, we highlightethical considerations for deploying such tools,emphasizing responsible use in sensitive do-mains, thereby advancing research in multilin-gual hate speech detection and online abusemitigation.
Anthology ID:
2024.icon-fauxhate.8
Volume:
Proceedings of the 21st International Conference on Natural Language Processing (ICON): Shared Task on Decoding Fake Narratives in Spreading Hateful Stories (Faux-Hate)
Month:
December
Year:
2024
Address:
AU-KBC Research Centre, Chennai, India
Editors:
Shankar Biradar, Kasu Sai Kartheek Reddy, Sunil Saumya, Md. Shad Akhtar
Venue:
ICON
SIG:
SIGLEX
Publisher:
NLP Association of India (NLPAI)
Note:
Pages:
40–44
Language:
URL:
https://aclanthology.org/2024.icon-fauxhate.8/
DOI:
Bibkey:
Cite (ACL):
R.n. Yadawad, Sunil Saumya, K.n. Nivedh, Siddhaling S. Padanur, and Sudev Basti. 2024. A Machine Learning Framework for Detecting Hate Speech and Fake Narratives in Hindi-English Tweets. In Proceedings of the 21st International Conference on Natural Language Processing (ICON): Shared Task on Decoding Fake Narratives in Spreading Hateful Stories (Faux-Hate), pages 40–44, AU-KBC Research Centre, Chennai, India. NLP Association of India (NLPAI).
Cite (Informal):
A Machine Learning Framework for Detecting Hate Speech and Fake Narratives in Hindi-English Tweets (Yadawad et al., ICON 2024)
Copy Citation:
PDF:
https://aclanthology.org/2024.icon-fauxhate.8.pdf