Quick and Simple Approach for Detecting Hate Speech in Arabic Tweets

Abeer Abuzayed, Tamer Elsayed


Abstract
As the use of social media platforms increases extensively to freely communicate and share opinions, hate speech becomes an outstanding problem that requires urgent attention. This paper focuses on the problem of detecting hate speech in Arabic tweets. To tackle the problem efficiently, we adopt a “quick and simple” approach by which we investigate the effectiveness of 15 classical (e.g., SVM) and neural (e.g., CNN) learning models, while exploring two different term representations. Our experiments on 8k labelled dataset show that the best neural learning models outperform the classical ones, while distributed term representation is more effective than statistical bag-of-words representation. Overall, our best classifier (that combines both CNN and RNN in a joint architecture) achieved 0.73 macro-F1 score on the dev set, which significantly outperforms the majority-class baseline that achieves 0.49, proving the effectiveness of our “quick and simple” approach.
Anthology ID:
2020.osact-1.18
Volume:
Proceedings of the 4th Workshop on Open-Source Arabic Corpora and Processing Tools, with a Shared Task on Offensive Language Detection
Month:
May
Year:
2020
Address:
Marseille, France
Editors:
Hend Al-Khalifa, Walid Magdy, Kareem Darwish, Tamer Elsayed, Hamdy Mubarak
Venue:
OSACT
SIG:
Publisher:
European Language Resource Association
Note:
Pages:
109–114
Language:
English
URL:
https://aclanthology.org/2020.osact-1.18
DOI:
Bibkey:
Cite (ACL):
Abeer Abuzayed and Tamer Elsayed. 2020. Quick and Simple Approach for Detecting Hate Speech in Arabic Tweets. In Proceedings of the 4th Workshop on Open-Source Arabic Corpora and Processing Tools, with a Shared Task on Offensive Language Detection, pages 109–114, Marseille, France. European Language Resource Association.
Cite (Informal):
Quick and Simple Approach for Detecting Hate Speech in Arabic Tweets (Abuzayed & Elsayed, OSACT 2020)
Copy Citation:
PDF:
https://aclanthology.org/2020.osact-1.18.pdf