cs@DravidianLangTech-EACL2021: Offensive Language Identification Based On Multilingual BERT Model

Shi Chen, Bing Kong


Abstract
This paper introduces the related content of the task “Offensive Language Identification in Dravidian LANGUAGES-EACL 2021”. The task requires us to classify Dravidian languages collected from social media into Not-Offensive, Off-Untargeted, Off-Target-Individual, etc. This data set contains actual annotations in code-mixed text posted by users on Youtube, not from the monolingual text in textbooks. Based on the features of the data set code mixture, we use multilingual BERT and TextCNN for semantic extraction and text classification. In this article, we will show the experiment and result analysis of this task.
Anthology ID:
2021.dravidianlangtech-1.31
Volume:
Proceedings of the First Workshop on Speech and Language Technologies for Dravidian Languages
Month:
April
Year:
2021
Address:
Kyiv
Editors:
Bharathi Raja Chakravarthi, Ruba Priyadharshini, Anand Kumar M, Parameswari Krishnamurthy, Elizabeth Sherly
Venue:
DravidianLangTech
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
230–235
Language:
URL:
https://aclanthology.org/2021.dravidianlangtech-1.31
DOI:
Bibkey:
Cite (ACL):
Shi Chen and Bing Kong. 2021. cs@DravidianLangTech-EACL2021: Offensive Language Identification Based On Multilingual BERT Model. In Proceedings of the First Workshop on Speech and Language Technologies for Dravidian Languages, pages 230–235, Kyiv. Association for Computational Linguistics.
Cite (Informal):
cs@DravidianLangTech-EACL2021: Offensive Language Identification Based On Multilingual BERT Model (Chen & Kong, DravidianLangTech 2021)
Copy Citation:
PDF:
https://aclanthology.org/2021.dravidianlangtech-1.31.pdf
Software:
 2021.dravidianlangtech-1.31.Software.zip