A Unified Taxonomy of Harmful Content

Michele Banko, Brendon MacKeen, Laurie Ray


Abstract
The ability to recognize harmful content within online communities has come into focus for researchers, engineers and policy makers seeking to protect users from abuse. While the number of datasets aiming to capture forms of abuse has grown in recent years, the community has not standardized around how various harmful behaviors are defined, creating challenges for reliable moderation, modeling and evaluation. As a step towards attaining shared understanding of how online abuse may be modeled, we synthesize the most common types of abuse described by industry, policy, community and health experts into a unified typology of harmful content, with detailed criteria and exceptions for each type of abuse.
Anthology ID:
2020.alw-1.16
Volume:
Proceedings of the Fourth Workshop on Online Abuse and Harms
Month:
November
Year:
2020
Address:
Online
Venues:
ALW | EMNLP
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
125–137
Language:
URL:
https://aclanthology.org/2020.alw-1.16
DOI:
10.18653/v1/2020.alw-1.16
Bibkey:
Cite (ACL):
Michele Banko, Brendon MacKeen, and Laurie Ray. 2020. A Unified Taxonomy of Harmful Content. In Proceedings of the Fourth Workshop on Online Abuse and Harms, pages 125–137, Online. Association for Computational Linguistics.
Cite (Informal):
A Unified Taxonomy of Harmful Content (Banko et al., ALW 2020)
Copy Citation:
PDF:
https://aclanthology.org/2020.alw-1.16.pdf
Video:
 https://slideslive.com/38939520