Leveraging Offensive Language for Sarcasm and Sentiment Detection in Arabic

Fatemah Husain, Ozlem Uzuner


Abstract
Sarcasm detection is one of the top challenging tasks in text classification, particularly for informal Arabic with high syntactic and semantic ambiguity. We propose two systems that harness knowledge from multiple tasks to improve the performance of the classifier. This paper presents the systems used in our participation to the two sub-tasks of the Sixth Arabic Natural Language Processing Workshop (WANLP); Sarcasm Detection and Sentiment Analysis. Our methodology is driven by the hypothesis that tweets with negative sentiment and tweets with sarcasm content are more likely to have offensive content, thus, fine-tuning the classification model using large corpus of offensive language, supports the learning process of the model to effectively detect sentiment and sarcasm contents. Results demonstrate the effectiveness of our approach for sarcasm detection task over sentiment analysis task.
Anthology ID:
2021.wanlp-1.47
Volume:
Proceedings of the Sixth Arabic Natural Language Processing Workshop
Month:
April
Year:
2021
Address:
Kyiv, Ukraine (Virtual)
Editors:
Nizar Habash, Houda Bouamor, Hazem Hajj, Walid Magdy, Wajdi Zaghouani, Fethi Bougares, Nadi Tomeh, Ibrahim Abu Farha, Samia Touileb
Venue:
WANLP
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
364–369
Language:
URL:
https://aclanthology.org/2021.wanlp-1.47
DOI:
Bibkey:
Cite (ACL):
Fatemah Husain and Ozlem Uzuner. 2021. Leveraging Offensive Language for Sarcasm and Sentiment Detection in Arabic. In Proceedings of the Sixth Arabic Natural Language Processing Workshop, pages 364–369, Kyiv, Ukraine (Virtual). Association for Computational Linguistics.
Cite (Informal):
Leveraging Offensive Language for Sarcasm and Sentiment Detection in Arabic (Husain & Uzuner, WANLP 2021)
Copy Citation:
PDF:
https://aclanthology.org/2021.wanlp-1.47.pdf