Lidoma@LT-EDI 2024:Tamil Hate Speech Detection in Migration Discourse

M. Tash, Z. Ahani, M. Zamir, O. Kolesnikova, G. Sidorov


Abstract
The exponential rise in social media users has revolutionized information accessibility and exchange. While these platforms serve various purposes, they also harbor negative elements, including hate speech and offensive behavior. Detecting hate speech in diverse languages has garnered significant attention in Natural Language Processing (NLP). This paper delves into hate speech detection in Tamil, particularly related to migration and refuge, contributing to the Caste/migration hate speech detection shared task. Employing a Convolutional Neural Network (CNN), our model achieved an F1 score of 0.76 in identifying hate speech and significant potential in the domain despite encountering complexities. We provide an overview of related research, methodology, and insights into the competition’s diverse performances, showcasing the landscape of hate speech detection nuances in the Tamil language.
Anthology ID:
2024.ltedi-1.20
Volume:
Proceedings of the Fourth Workshop on Language Technology for Equality, Diversity, Inclusion
Month:
March
Year:
2024
Address:
St. Julian's, Malta
Editors:
Bharathi Raja Chakravarthi, Bharathi B, Paul Buitelaar, Thenmozhi Durairaj, György Kovács, Miguel Ángel García Cumbreras
Venues:
LTEDI | WS
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
184–189
Language:
URL:
https://aclanthology.org/2024.ltedi-1.20
DOI:
Bibkey:
Cite (ACL):
M. Tash, Z. Ahani, M. Zamir, O. Kolesnikova, and G. Sidorov. 2024. Lidoma@LT-EDI 2024:Tamil Hate Speech Detection in Migration Discourse. In Proceedings of the Fourth Workshop on Language Technology for Equality, Diversity, Inclusion, pages 184–189, St. Julian's, Malta. Association for Computational Linguistics.
Cite (Informal):
Lidoma@LT-EDI 2024:Tamil Hate Speech Detection in Migration Discourse (Tash et al., LTEDI-WS 2024)
Copy Citation:
PDF:
https://aclanthology.org/2024.ltedi-1.20.pdf