Algorithm Alliance@LT-EDI-2024: Caste and Migration Hate Speech Detection

Saisandeep Sangeetham, Shreyamanisha Vinay, Kavin Rajan G, Abishna A, Bharathi B


Abstract
Caste and Migration speech refers to the use of language that distinguishes the offense, violence, and distress on their social, caste, and migration status. Here, caste hate speech targets the imbalance of an individual’s social status and focuses mainly on the degradation of their caste group. While the migration hate speech imposes the differences in nationality, culture, and individual status. These speeches are meant to affront the social status of these people. To detect this hate in the speech, our task on Caste and Migration Hate Speech Detection has been created which classifies human speech into genuine or stimulate categories. For this task, we used multiple classification models such as the train test split model to split the dataset into train and test data, Logistic regression, Support Vector Machine, MLP (multi-layer Perceptron) classifier, Random Forest classifier, KNN classifier, and Decision tree classification. Among these models, The SVM gave the highest macro average F1 score of 0.77 and the average accuracy for these models is around 0.75.
Anthology ID:
2024.ltedi-1.33
Volume:
Proceedings of the Fourth Workshop on Language Technology for Equality, Diversity, Inclusion
Month:
March
Year:
2024
Address:
St. Julian's, Malta
Editors:
Bharathi Raja Chakravarthi, Bharathi B, Paul Buitelaar, Thenmozhi Durairaj, György Kovács, Miguel Ángel García Cumbreras
Venues:
LTEDI | WS
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
254–258
Language:
URL:
https://aclanthology.org/2024.ltedi-1.33
DOI:
Bibkey:
Cite (ACL):
Saisandeep Sangeetham, Shreyamanisha Vinay, Kavin Rajan G, Abishna A, and Bharathi B. 2024. Algorithm Alliance@LT-EDI-2024: Caste and Migration Hate Speech Detection. In Proceedings of the Fourth Workshop on Language Technology for Equality, Diversity, Inclusion, pages 254–258, St. Julian's, Malta. Association for Computational Linguistics.
Cite (Informal):
Algorithm Alliance@LT-EDI-2024: Caste and Migration Hate Speech Detection (Sangeetham et al., LTEDI-WS 2024)
Copy Citation:
PDF:
https://aclanthology.org/2024.ltedi-1.33.pdf