girlsteam@LT-EDI-2025: Caste/Migration based hate speech Detection

Towshin Hossain Tushi; Walisa Alam; Rehenuma Ilman; Samia Rahman

girlsteam@LT-EDI-2025: Caste/Migration based hate speech Detection

Towshin Hossain Tushi, Walisa Alam, Rehenuma Ilman, Samia Rahman

Abstract

The proliferation of caste- and migration-based hate speech on social media poses a significant challenge, particularly in low-resource languages like Tamil. This paper presents our approach to the LT-EDI@ACL 2025 shared task, addressing this issue through a hybrid transformer-based framework. We explore a range of Machine Learning (ML), Deep Learning (DL), and multilingual transformer models, culminating in a novel m-BERT+BiLSTM hybrid architecture. This model integrates contextual embeddings from m-BERT with lexical features from TF-IDF and FastText, feeding the enriched representations into a BiLSTM to capture bidirectional semantic dependencies. Empirical results demonstrate the superiority of this hybrid architecture, achieving a macro-F1 score of 0.76 on the test set and surpassing the performance of standalone models such as MuRIL and IndicBERT. These results affirm the effectiveness of hybrid multilingual models for hate speech detection in low-resource and culturally complex linguistic settings.

Anthology ID:: 2025.ltedi-1.29
Volume:: Proceedings of the 5th Conference on Language, Data and Knowledge: Fifth Workshop on Language Technology for Equality, Diversity, Inclusion
Month:: September
Year:: 2025
Address:: Naples, Italy
Editors:: Katerina Gkirtzou, Slavko Žitnik, Jorge Gracia, Dagmar Gromann, Maria Pia di Buono, Johanna Monti, Maxim Ionov
Venues:: LTEDI | WS
SIG:
Publisher:: Unior Press
Note:
Pages:: 178–183
Language:
URL:: https://aclanthology.org/2025.ltedi-1.29/
DOI:
Bibkey:
Cite (ACL):: Towshin Hossain Tushi, Walisa Alam, Rehenuma Ilman, and Samia Rahman. 2025. girlsteam@LT-EDI-2025: Caste/Migration based hate speech Detection. In Proceedings of the 5th Conference on Language, Data and Knowledge: Fifth Workshop on Language Technology for Equality, Diversity, Inclusion, pages 178–183, Naples, Italy. Unior Press.
Cite (Informal):: girlsteam@LT-EDI-2025: Caste/Migration based hate speech Detection (Tushi et al., LTEDI 2025)
Copy Citation:
PDF:: https://aclanthology.org/2025.ltedi-1.29.pdf

PDF Cite Search Fix data