A Fine-Grained Taxonomy of Replies to Hate Speech

Xinchen Yu, Ashley Zhao, Eduardo Blanco, Lingzi Hong


Abstract
Countering rather than censoring hate speech has emerged as a promising strategy to address hatred. There are many types of counterspeech in user-generated content: addressing the hateful content or its author, generic requests, well-reasoned counter arguments, insults, etc. The effectiveness of counterspeech, which we define as subsequent incivility, depends on these types. In this paper, we present a theoretically grounded taxonomy of replies to hate speech and a new corpus. We work with real, user-generated hate speech and all the replies it elicits rather than replies generated by a third party. Our analyses provide insights into the content real users reply with as well as which replies are empirically most effective. We also experiment with models to characterize the replies to hate speech, thereby opening the door to estimating whether a reply to hate speech will result in further incivility.
Anthology ID:
2023.emnlp-main.450
Volume:
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing
Month:
December
Year:
2023
Address:
Singapore
Editors:
Houda Bouamor, Juan Pino, Kalika Bali
Venue:
EMNLP
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
7275–7289
Language:
URL:
https://aclanthology.org/2023.emnlp-main.450
DOI:
10.18653/v1/2023.emnlp-main.450
Bibkey:
Cite (ACL):
Xinchen Yu, Ashley Zhao, Eduardo Blanco, and Lingzi Hong. 2023. A Fine-Grained Taxonomy of Replies to Hate Speech. In Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, pages 7275–7289, Singapore. Association for Computational Linguistics.
Cite (Informal):
A Fine-Grained Taxonomy of Replies to Hate Speech (Yu et al., EMNLP 2023)
Copy Citation:
PDF:
https://aclanthology.org/2023.emnlp-main.450.pdf
Video:
 https://aclanthology.org/2023.emnlp-main.450.mp4