A Disaggregated Dataset on English Offensiveness Containing Spans

Pia Pachinger; Janis Goldzycher; Anna M. Planitzer; Julia Neidhardt; Allan Hanbury

doi:10.18653/v1/2025.nlperspectives-1.1

A Disaggregated Dataset on English Offensiveness Containing Spans

Pia Pachinger, Janis Goldzycher, Anna M. Planitzer, Julia Neidhardt, Allan Hanbury

Abstract

Toxicity labels at sub-document granularity and disaggregated labels lead to more nuanced and personalized toxicity classification and facilitate analysis. We re-annotate a subset of 1983 posts of the Jigsaw Toxic Comment Classification Challenge and provide disaggregated toxicity labels and spans that identify inappropriate language and targets of toxic statements. Manual analysis shows that five annotations per instance effectively capture meaningful disagreement patterns and allow for finer distinctions between genuine disagreement and that arising from annotation error or inconsistency. Our main findings are: (1) Disagreement often stems from divergent interpretations of edge-case toxicity (2) Disagreement is especially high in cases of toxic statements involving non-human targets (3) Disagreement on whether a passage consists of inappropriate language occurs not only on inherently questionable terms, but also on words that may be inappropriate in specific contexts while remaining acceptable in others (4) Transformer-based models effectively learn from aggregated data that reduces false negative classifications by being more sensitive towards minority opinions for posts to be toxic. We publish the new annotations under the CC BY 4.0 license.

Anthology ID:: 2025.nlperspectives-1.1
Volume:: Proceedings of the The 4th Workshop on Perspectivist Approaches to NLP
Month:: November
Year:: 2025
Address:: Suzhou, China
Editors:: Gavin Abercrombie, Valerio Basile, Simona Frenda, Sara Tonelli, Shiran Dudy
Venues:: NLPerspectives | WS
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 1–14
Language:
URL:: https://aclanthology.org/2025.nlperspectives-1.1/
DOI:: 10.18653/v1/2025.nlperspectives-1.1
Bibkey:
Cite (ACL):: Pia Pachinger, Janis Goldzycher, Anna M. Planitzer, Julia Neidhardt, and Allan Hanbury. 2025. A Disaggregated Dataset on English Offensiveness Containing Spans. In Proceedings of the The 4th Workshop on Perspectivist Approaches to NLP, pages 1–14, Suzhou, China. Association for Computational Linguistics.
Cite (Informal):: A Disaggregated Dataset on English Offensiveness Containing Spans (Pachinger et al., NLPerspectives 2025)
Copy Citation:
PDF:: https://aclanthology.org/2025.nlperspectives-1.1.pdf

PDF Cite Search Fix data