Jibes & Delights: A Dataset of Targeted Insults and Compliments to Tackle Online Abuse

Ravsimar Sodhi, Kartikey Pant, Radhika Mamidi


Abstract
Online abuse and offensive language on social media have become widespread problems in today’s digital age. In this paper, we contribute a Reddit-based dataset, consisting of 68,159 insults and 51,102 compliments targeted at individuals instead of targeting a particular community or race. Secondly, we benchmark multiple existing state-of-the-art models for both classification and unsupervised style transfer on the dataset. Finally, we analyse the experimental results and conclude that the transfer task is challenging, requiring the models to understand the high degree of creativity exhibited in the data.
Anthology ID:
2021.woah-1.14
Volume:
Proceedings of the 5th Workshop on Online Abuse and Harms (WOAH 2021)
Month:
August
Year:
2021
Address:
Online
Venues:
ACL | IJCNLP | WOAH
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
132–139
Language:
URL:
https://aclanthology.org/2021.woah-1.14
DOI:
10.18653/v1/2021.woah-1.14
Bibkey:
Cite (ACL):
Ravsimar Sodhi, Kartikey Pant, and Radhika Mamidi. 2021. Jibes & Delights: A Dataset of Targeted Insults and Compliments to Tackle Online Abuse. In Proceedings of the 5th Workshop on Online Abuse and Harms (WOAH 2021), pages 132–139, Online. Association for Computational Linguistics.
Cite (Informal):
Jibes & Delights: A Dataset of Targeted Insults and Compliments to Tackle Online Abuse (Sodhi et al., WOAH 2021)
Copy Citation:
PDF:
https://aclanthology.org/2021.woah-1.14.pdf