Implicitly Abusive Language – What does it actually look like and why are we not getting there?

Michael Wiegand, Josef Ruppenhofer, Elisabeth Eder


Abstract
Abusive language detection is an emerging field in natural language processing which has received a large amount of attention recently. Still the success of automatic detection is limited. Particularly, the detection of implicitly abusive language, i.e. abusive language that is not conveyed by abusive words (e.g. dumbass or scum), is not working well. In this position paper, we explain why existing datasets make learning implicit abuse difficult and what needs to be changed in the design of such datasets. Arguing for a divide-and-conquer strategy, we present a list of subtypes of implicitly abusive language and formulate research tasks and questions for future research.
Anthology ID:
2021.naacl-main.48
Volume:
Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
Month:
June
Year:
2021
Address:
Online
Venue:
NAACL
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
576–587
Language:
URL:
https://aclanthology.org/2021.naacl-main.48
DOI:
10.18653/v1/2021.naacl-main.48
Bibkey:
Copy Citation:
PDF:
https://aclanthology.org/2021.naacl-main.48.pdf