ArSarcasMoji Dataset: The Emoji Sentiment Roles in Arabic Ironic Contexts

Shatha Ali A. Hakami, Robert Hendley, Phillip Smith


Abstract
In digital communication, emoji are essential in decoding nuances such as irony, sarcasm, and humour. However, their incorporation in Arabic natural language processing (NLP) has been cautious because of the perceived complexities of the Arabic language. This paper introduces ArSarcasMoji, a dataset of 24,630 emoji-augmented texts, with 17. 5% that shows irony. Through our analysis, we highlight specific emoji patterns paired with sentiment roles that denote irony in Arabic texts. The research counters prevailing notions, emphasising the importance of emoji’s role in understanding Arabic textual irony, and addresses their potential for accurate irony detection in Arabic digital content.
Anthology ID:
2023.arabicnlp-1.18
Volume:
Proceedings of ArabicNLP 2023
Month:
December
Year:
2023
Address:
Singapore (Hybrid)
Editors:
Hassan Sawaf, Samhaa El-Beltagy, Wajdi Zaghouani, Walid Magdy, Ahmed Abdelali, Nadi Tomeh, Ibrahim Abu Farha, Nizar Habash, Salam Khalifa, Amr Keleg, Hatem Haddad, Imed Zitouni, Khalil Mrini, Rawan Almatham
Venues:
ArabicNLP | WS
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
208–217
Language:
URL:
https://aclanthology.org/2023.arabicnlp-1.18
DOI:
10.18653/v1/2023.arabicnlp-1.18
Bibkey:
Cite (ACL):
Shatha Ali A. Hakami, Robert Hendley, and Phillip Smith. 2023. ArSarcasMoji Dataset: The Emoji Sentiment Roles in Arabic Ironic Contexts. In Proceedings of ArabicNLP 2023, pages 208–217, Singapore (Hybrid). Association for Computational Linguistics.
Cite (Informal):
ArSarcasMoji Dataset: The Emoji Sentiment Roles in Arabic Ironic Contexts (Hakami et al., ArabicNLP-WS 2023)
Copy Citation:
PDF:
https://aclanthology.org/2023.arabicnlp-1.18.pdf