Cleansing & expanding the HURTLEX(el) with a multidimensional categorization of offensive words

Vivian Stamou, Iakovi Alexiou, Antigone Klimi, Eleftheria Molou, Alexandra Saivanidou, Stella Markantonatou


Abstract
We present a cleansed version of the multilingual lexicon HURTLEX-(EL) comprising 737 offensive words of Modern Greek. We worked bottom-up in two annotation rounds and developed detailed guidelines by cross-classifying words on three dimensions: context, reference, and thematic domain. Our classification reveals a wider spectrum of thematic domains concerning the study of offensive language than previously thought Efthymiou et al. (2014) and reveals social and cultural aspects that are not included in the HURTLEX categories.
Anthology ID:
2022.woah-1.10
Volume:
Proceedings of the Sixth Workshop on Online Abuse and Harms (WOAH)
Month:
July
Year:
2022
Address:
Seattle, Washington (Hybrid)
Editors:
Kanika Narang, Aida Mostafazadeh Davani, Lambert Mathias, Bertie Vidgen, Zeerak Talat
Venue:
WOAH
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
102–108
Language:
URL:
https://aclanthology.org/2022.woah-1.10
DOI:
10.18653/v1/2022.woah-1.10
Bibkey:
Cite (ACL):
Vivian Stamou, Iakovi Alexiou, Antigone Klimi, Eleftheria Molou, Alexandra Saivanidou, and Stella Markantonatou. 2022. Cleansing & expanding the HURTLEX(el) with a multidimensional categorization of offensive words. In Proceedings of the Sixth Workshop on Online Abuse and Harms (WOAH), pages 102–108, Seattle, Washington (Hybrid). Association for Computational Linguistics.
Cite (Informal):
Cleansing & expanding the HURTLEX(el) with a multidimensional categorization of offensive words (Stamou et al., WOAH 2022)
Copy Citation:
PDF:
https://aclanthology.org/2022.woah-1.10.pdf
Video:
 https://aclanthology.org/2022.woah-1.10.mp4