HARMONY@DravidianLangTech: Transformer-based Ensemble Learning for Abusive Comment Detection

Amrish Raaj P, Abirami Murugappan, Lysa Packiam R S, Deivamani M


Abstract
Millions of posts and comments are created every minute as a result of the widespread use of social media and easy access to the internet.It is essential to create an inclusive environment and forbid the use of abusive language against any individual or group of individuals.This paper describes the approach of team HARMONY for the “Abusive Comment Detection” shared task at the Third Workshop on Speech and Language Technologies for Dravidian Languages.A Transformer-based ensemble learning approach is proposed for detecting abusive comments in code-mixed (Tamil-English) language and Tamil language. The proposed architecture achieved rank 2 in Tamil text classification sub task and rank 3 in code mixed text classification sub task with macro-F1 score of 0.41 for Tamil and 0.50 for code-mixed data.
Anthology ID:
2023.dravidianlangtech-1.21
Volume:
Proceedings of the Third Workshop on Speech and Language Technologies for Dravidian Languages
Month:
September
Year:
2023
Address:
Varna, Bulgaria
Editors:
Bharathi R. Chakravarthi, Ruba Priyadharshini, Anand Kumar M, Sajeetha Thavareesan, Elizabeth Sherly
Venues:
DravidianLangTech | WS
SIG:
Publisher:
INCOMA Ltd., Shoumen, Bulgaria
Note:
Pages:
160–165
Language:
URL:
https://aclanthology.org/2023.dravidianlangtech-1.21
DOI:
Bibkey:
Cite (ACL):
Amrish Raaj P, Abirami Murugappan, Lysa Packiam R S, and Deivamani M. 2023. HARMONY@DravidianLangTech: Transformer-based Ensemble Learning for Abusive Comment Detection. In Proceedings of the Third Workshop on Speech and Language Technologies for Dravidian Languages, pages 160–165, Varna, Bulgaria. INCOMA Ltd., Shoumen, Bulgaria.
Cite (Informal):
HARMONY@DravidianLangTech: Transformer-based Ensemble Learning for Abusive Comment Detection (Raaj P et al., DravidianLangTech-WS 2023)
Copy Citation:
PDF:
https://aclanthology.org/2023.dravidianlangtech-1.21.pdf