Unsupervised Domain Adaptation in Cross-corpora Abusive Language Detection

Tulika Bose, Irina Illina, Dominique Fohr


Abstract
The state-of-the-art abusive language detection models report great in-corpus performance, but underperform when evaluated on abusive comments that differ from the training scenario. As human annotation involves substantial time and effort, models that can adapt to newly collected comments can prove to be useful. In this paper, we investigate the effectiveness of several Unsupervised Domain Adaptation (UDA) approaches for the task of cross-corpora abusive language detection. In comparison, we adapt a variant of the BERT model, trained on large-scale abusive comments, using Masked Language Model (MLM) fine-tuning. Our evaluation shows that the UDA approaches result in sub-optimal performance, while the MLM fine-tuning does better in the cross-corpora setting. Detailed analysis reveals the limitations of the UDA approaches and emphasizes the need to build efficient adaptation methods for this task.
Anthology ID:
2021.socialnlp-1.10
Volume:
Proceedings of the Ninth International Workshop on Natural Language Processing for Social Media
Month:
June
Year:
2021
Address:
Online
Editors:
Lun-Wei Ku, Cheng-Te Li
Venue:
SocialNLP
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
113–122
Language:
URL:
https://aclanthology.org/2021.socialnlp-1.10
DOI:
10.18653/v1/2021.socialnlp-1.10
Bibkey:
Cite (ACL):
Tulika Bose, Irina Illina, and Dominique Fohr. 2021. Unsupervised Domain Adaptation in Cross-corpora Abusive Language Detection. In Proceedings of the Ninth International Workshop on Natural Language Processing for Social Media, pages 113–122, Online. Association for Computational Linguistics.
Cite (Informal):
Unsupervised Domain Adaptation in Cross-corpora Abusive Language Detection (Bose et al., SocialNLP 2021)
Copy Citation:
PDF:
https://aclanthology.org/2021.socialnlp-1.10.pdf
Data
Hate Speech and Offensive Language