Implicitly Abusive Comparisons – A New Dataset and Linguistic Analysis

Michael Wiegand, Maja Geulig, Josef Ruppenhofer


Abstract
We examine the task of detecting implicitly abusive comparisons (e.g. “Your hair looks like you have been electrocuted”). Implicitly abusive comparisons are abusive comparisons in which abusive words (e.g. “dumbass” or “scum”) are absent. We detail the process of creating a novel dataset for this task via crowdsourcing that includes several measures to obtain a sufficiently representative and unbiased set of comparisons. We also present classification experiments that include a range of linguistic features that help us better understand the mechanisms underlying abusive comparisons.
Anthology ID:
2021.eacl-main.27
Volume:
Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume
Month:
April
Year:
2021
Address:
Online
Editors:
Paola Merlo, Jorg Tiedemann, Reut Tsarfaty
Venue:
EACL
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
358–368
Language:
URL:
https://aclanthology.org/2021.eacl-main.27
DOI:
10.18653/v1/2021.eacl-main.27
Bibkey:
Cite (ACL):
Michael Wiegand, Maja Geulig, and Josef Ruppenhofer. 2021. Implicitly Abusive Comparisons – A New Dataset and Linguistic Analysis. In Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume, pages 358–368, Online. Association for Computational Linguistics.
Cite (Informal):
Implicitly Abusive Comparisons – A New Dataset and Linguistic Analysis (Wiegand et al., EACL 2021)
Copy Citation:
PDF:
https://aclanthology.org/2021.eacl-main.27.pdf
Code
 miwieg/implicitly_abusive_comparisons