A Novel Methodology for Developing Automatic Harassment Classifiers for Twitter

Ishaan Arora; Julia Guo; Sarah Ita Levitan; Susan McGregor; Julia Hirschberg

doi:10.18653/v1/2020.alw-1.2

A Novel Methodology for Developing Automatic Harassment Classifiers for Twitter

Ishaan Arora, Julia Guo, Sarah Ita Levitan, Susan McGregor, Julia Hirschberg

Abstract

Most efforts at identifying abusive speech online rely on public corpora that have been scraped from websites using keyword-based queries or released by site or platform owners for research purposes. These are typically labeled by crowd-sourced annotators – not the targets of the abuse themselves. While this method of data collection supports fast development of machine learning classifiers, the models built on them often fail in the context of real-world harassment and abuse, which contain nuances less easily identified by non-targets. Here, we present a mixed-methods approach to create classifiers for abuse and harassment which leverages direct engagement with the target group in order to achieve high quality and ecological validity of data sets and labels, and to generate deeper insights into the key tactics of bad actors. We use women journalists’ experience on Twitter as an initial community of focus. We identify several structural mechanisms of abuse that we believe will generalize to other target communities.

Anthology ID:: 2020.alw-1.2
Volume:: Proceedings of the Fourth Workshop on Online Abuse and Harms
Month:: November
Year:: 2020
Address:: Online
Editors:: Seyi Akiwowo, Bertie Vidgen, Vinodkumar Prabhakaran, Zeerak Waseem
Venue:: ALW
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 7–15
Language:
URL:: https://aclanthology.org/2020.alw-1.2/
DOI:: 10.18653/v1/2020.alw-1.2
Bibkey:
Cite (ACL):: Ishaan Arora, Julia Guo, Sarah Ita Levitan, Susan McGregor, and Julia Hirschberg. 2020. A Novel Methodology for Developing Automatic Harassment Classifiers for Twitter. In Proceedings of the Fourth Workshop on Online Abuse and Harms, pages 7–15, Online. Association for Computational Linguistics.
Cite (Informal):: A Novel Methodology for Developing Automatic Harassment Classifiers for Twitter (Arora et al., ALW 2020)
Copy Citation:
PDF:: https://aclanthology.org/2020.alw-1.2.pdf
Video:: https://slideslive.com/38939517

PDF Cite Search Video Fix data