Combining Context-Free and Contextualized Representations for Arabic Sarcasm Detection and Sentiment Identification

Amey Hengle, Atharva Kshirsagar, Shaily Desai, Manisha Marathe


Abstract
Since their inception, transformer-based language models have led to impressive performance gains across multiple natural language processing tasks. For Arabic, the current state-of-the-art results on most datasets are achieved by the AraBERT language model. Notwithstanding these recent advancements, sarcasm and sentiment detection persist to be challenging tasks in Arabic, given the language’s rich morphology, linguistic disparity and dialectal variations. This paper proffers team SPPU-AASM’s submission for the WANLP ArSarcasm shared-task 2021, which centers around the sarcasm and sentiment polarity detection of Arabic tweets. The study proposes a hybrid model, combining sentence representations from AraBERT with static word vectors trained on Arabic social media corpora. The proposed system achieves a F1-sarcastic score of 0.62 and a F-PN score of 0.715 for the sarcasm and sentiment detection tasks, respectively. Simulation results show that the proposed system outperforms multiple existing approaches for both the tasks, suggesting that the amalgamation of context-free and context-dependent text representations can help capture complementary facets of word meaning in Arabic. The system ranked second and tenth in the respective sub-tasks of sarcasm detection and sentiment identification.
Anthology ID:
2021.wanlp-1.46
Volume:
Proceedings of the Sixth Arabic Natural Language Processing Workshop
Month:
April
Year:
2021
Address:
Kyiv, Ukraine (Virtual)
Editors:
Nizar Habash, Houda Bouamor, Hazem Hajj, Walid Magdy, Wajdi Zaghouani, Fethi Bougares, Nadi Tomeh, Ibrahim Abu Farha, Samia Touileb
Venue:
WANLP
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
357–363
Language:
URL:
https://aclanthology.org/2021.wanlp-1.46
DOI:
Bibkey:
Cite (ACL):
Amey Hengle, Atharva Kshirsagar, Shaily Desai, and Manisha Marathe. 2021. Combining Context-Free and Contextualized Representations for Arabic Sarcasm Detection and Sentiment Identification. In Proceedings of the Sixth Arabic Natural Language Processing Workshop, pages 357–363, Kyiv, Ukraine (Virtual). Association for Computational Linguistics.
Cite (Informal):
Combining Context-Free and Contextualized Representations for Arabic Sarcasm Detection and Sentiment Identification (Hengle et al., WANLP 2021)
Copy Citation:
PDF:
https://aclanthology.org/2021.wanlp-1.46.pdf
Data
ArSarcasm