Aggression Detection in Social Media: Using Deep Neural Networks, Data Augmentation, and Pseudo Labeling

Segun Taofeek Aroyehun, Alexander Gelbukh


Abstract
With the advent of the read-write web which facilitates social interactions in online spaces, the rise of anti-social behaviour in online spaces has attracted the attention of researchers. In this paper, we address the challenge of automatically identifying aggression in social media posts. Our team, saroyehun, participated in the English track of the Aggression Detection in Social Media Shared Task. On this task, we investigate the efficacy of deep neural network models of varying complexity. Our results reveal that deep neural network models require more data points to do better than an NBSVM linear baseline based on character n-grams. Our improved deep neural network models were trained on augmented data and pseudo labeled examples. Our LSTM classifier receives a weighted macro-F1 score of 0.6425 to rank first overall on the Facebook subtask of the shared task. On the social media sub-task, our CNN-LSTM model records a weighted macro-F1 score of 0.5920 to place third overall.
Anthology ID:
W18-4411
Volume:
Proceedings of the First Workshop on Trolling, Aggression and Cyberbullying (TRAC-2018)
Month:
August
Year:
2018
Address:
Santa Fe, New Mexico, USA
Venues:
COLING | TRAC | WS
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
90–97
Language:
URL:
https://aclanthology.org/W18-4411
DOI:
Bibkey:
Cite (ACL):
Segun Taofeek Aroyehun and Alexander Gelbukh. 2018. Aggression Detection in Social Media: Using Deep Neural Networks, Data Augmentation, and Pseudo Labeling. In Proceedings of the First Workshop on Trolling, Aggression and Cyberbullying (TRAC-2018), pages 90–97, Santa Fe, New Mexico, USA. Association for Computational Linguistics.
Cite (Informal):
Aggression Detection in Social Media: Using Deep Neural Networks, Data Augmentation, and Pseudo Labeling (Aroyehun & Gelbukh, 2018)
Copy Citation:
PDF:
https://aclanthology.org/W18-4411.pdf