IIITDWD@TamilNLP-ACL2022: Transformer-based approach to classify abusive content in Dravidian Code-mixed text

Shankar Biradar; Sunil Saumya

doi:10.18653/v1/2022.dravidianlangtech-1.16

IIITDWD@TamilNLP-ACL2022: Transformer-based approach to classify abusive content in Dravidian Code-mixed text

Abstract

Identifying abusive content or hate speech in social media text has raised the research community’s interest in recent times. The major driving force behind this is the widespread use of social media websites. Further, it also leads to identifying abusive content in low-resource regional languages, which is an important research problem in computational linguistics. As part of ACL-2022, organizers of DravidianLangTech@ACL 2022 have released a shared task on abusive category identification in Tamil and Tamil-English code-mixed text to encourage further research on offensive content identification in low-resource Indic languages. This paper presents the working notes for the model submitted by IIITDWD at DravidianLangTech@ACL 2022. Our team competed in Sub-Task B and finished in 9th place among the participating teams. In our proposed approach, we used a pre-trained transformer model such as Indic-bert for feature extraction, and on top of that, SVM classifier is used for stance detection. Further, our model achieved 62 % accuracy on code-mixed Tamil-English text.

Anthology ID:: 2022.dravidianlangtech-1.16
Volume:: Proceedings of the Second Workshop on Speech and Language Technologies for Dravidian Languages
Month:: May
Year:: 2022
Address:: Dublin, Ireland
Editors:: Bharathi Raja Chakravarthi, Ruba Priyadharshini, Anand Kumar Madasamy, Parameswari Krishnamurthy, Elizabeth Sherly, Sinnathamby Mahesan
Venue:: DravidianLangTech
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 100–104
Language:
URL:: https://aclanthology.org/2022.dravidianlangtech-1.16/
DOI:: 10.18653/v1/2022.dravidianlangtech-1.16
Bibkey:
Cite (ACL):: Shankar Biradar and Sunil Saumya. 2022. IIITDWD@TamilNLP-ACL2022: Transformer-based approach to classify abusive content in Dravidian Code-mixed text. In Proceedings of the Second Workshop on Speech and Language Technologies for Dravidian Languages, pages 100–104, Dublin, Ireland. Association for Computational Linguistics.
Cite (Informal):: IIITDWD@TamilNLP-ACL2022: Transformer-based approach to classify abusive content in Dravidian Code-mixed text (Biradar & Saumya, DravidianLangTech 2022)
Copy Citation:
PDF:: https://aclanthology.org/2022.dravidianlangtech-1.16.pdf
Video:: https://aclanthology.org/2022.dravidianlangtech-1.16.mp4

PDF Cite Search Video Fix data