Plumeria at SemEval-2022 Task 6: Sarcasm Detection for English and Arabic Using Transformers and Data Augmentation

Mosab Shaheen, Shubham Nigam


Abstract
The paper describes our submission to SemEval-2022 Task 6 on sarcasm detection and its five subtasks for English and Arabic. Sarcasm conveys a meaning which contradicts the literal meaning, and it is mainly found on social networks. It has a significant role in understanding the intention of the user. For detecting sarcasm, we used deep learning techniques based on transformers due to its success in the field of Natural Language Processing (NLP) without the need for feature engineering. The datasets were taken from tweets. We created new datasets by augmenting with external data or by using word embeddings and repetition of instances. Experiments were done on the datasets with different types of preprocessing because it is crucial in this task. The rank of our team was consistent across four subtasks (fourth rank in three subtasks and sixth rank in one subtask); whereas other teams might be in the top ranks for some subtasks but rank drastically less in other subtasks. This implies the robustness and stability of the models and the techniques we used.
Anthology ID:
2022.semeval-1.130
Volume:
Proceedings of the 16th International Workshop on Semantic Evaluation (SemEval-2022)
Month:
July
Year:
2022
Address:
Seattle, United States
Editors:
Guy Emerson, Natalie Schluter, Gabriel Stanovsky, Ritesh Kumar, Alexis Palmer, Nathan Schneider, Siddharth Singh, Shyam Ratan
Venue:
SemEval
SIG:
SIGLEX
Publisher:
Association for Computational Linguistics
Note:
Pages:
923–937
Language:
URL:
https://aclanthology.org/2022.semeval-1.130
DOI:
10.18653/v1/2022.semeval-1.130
Bibkey:
Cite (ACL):
Mosab Shaheen and Shubham Nigam. 2022. Plumeria at SemEval-2022 Task 6: Sarcasm Detection for English and Arabic Using Transformers and Data Augmentation. In Proceedings of the 16th International Workshop on Semantic Evaluation (SemEval-2022), pages 923–937, Seattle, United States. Association for Computational Linguistics.
Cite (Informal):
Plumeria at SemEval-2022 Task 6: Sarcasm Detection for English and Arabic Using Transformers and Data Augmentation (Shaheen & Nigam, SemEval 2022)
Copy Citation:
PDF:
https://aclanthology.org/2022.semeval-1.130.pdf
Video:
 https://aclanthology.org/2022.semeval-1.130.mp4
Data
ArSarcasm-v2