CRUSH: Contextually Regularized and User anchored Self-supervised Hate speech Detection

Souvic Chakraborty, Parag Dutta, Sumegh Roychowdhury, Animesh Mukherjee


Abstract
The last decade has witnessed a surge in the interaction of people through social networking platforms. While there are several positive aspects of these social platforms, their proliferation has led them to become the breeding ground for cyber-bullying and hate speech. Recent advances in NLP have often been used to mitigate the spread of such hateful content. Since the task of hate speech detection is usually applicable in the context of social networks, we introduce CRUSH, a framework for hate speech detection using User Anchored self-supervision and contextual regularization. Our proposed approach secures ~1-12% improvement in test set metrics over best performing previous approaches on two types of tasks and multiple popular English language social networking datasets.
Anthology ID:
2022.findings-naacl.144
Volume:
Findings of the Association for Computational Linguistics: NAACL 2022
Month:
July
Year:
2022
Address:
Seattle, United States
Editors:
Marine Carpuat, Marie-Catherine de Marneffe, Ivan Vladimir Meza Ruiz
Venue:
Findings
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
1874–1886
Language:
URL:
https://aclanthology.org/2022.findings-naacl.144
DOI:
10.18653/v1/2022.findings-naacl.144
Bibkey:
Cite (ACL):
Souvic Chakraborty, Parag Dutta, Sumegh Roychowdhury, and Animesh Mukherjee. 2022. CRUSH: Contextually Regularized and User anchored Self-supervised Hate speech Detection. In Findings of the Association for Computational Linguistics: NAACL 2022, pages 1874–1886, Seattle, United States. Association for Computational Linguistics.
Cite (Informal):
CRUSH: Contextually Regularized and User anchored Self-supervised Hate speech Detection (Chakraborty et al., Findings 2022)
Copy Citation:
PDF:
https://aclanthology.org/2022.findings-naacl.144.pdf
Software:
 2022.findings-naacl.144.software.zip
Video:
 https://aclanthology.org/2022.findings-naacl.144.mp4
Code
 parag1604/crush
Data
HateXplainRuddit