BpHigh at WASSA 2023: Using Contrastive Learning to build Sentence Transformer models for Multi-Class Emotion Classification in Code-mixed Urdu

Bhavish Pahwa

doi:10.18653/v1/2023.wassa-1.59

BpHigh at WASSA 2023: Using Contrastive Learning to build Sentence Transformer models for Multi-Class Emotion Classification in Code-mixed Urdu

Abstract

In this era of digital communication and social media, texting and chatting among individuals occur mainly through code-mixed or Romanized versions of the native language prevalent in the region. The presence of Romanized and code-mixed language develops the need to build NLP systems in these domains to leverage the digital content for various use cases. This paper describes our contribution to the subtask MCEC of the shared task WASSA 2023:Shared Task on Multi-Label and Multi-Class Emotion Classification on Code-Mixed Text Messages. We explore how one can build sentence transformers models for low-resource languages using unsupervised data by leveraging contrastive learning techniques described in the SIMCSE paper and using the sentence transformer developed to build classification models using the SetFit approach. Additionally, we’ll publish our code and models on GitHub and HuggingFace, two open-source hosting services.

Anthology ID:: 2023.wassa-1.59
Volume:: Proceedings of the 13th Workshop on Computational Approaches to Subjectivity, Sentiment, & Social Media Analysis
Month:: July
Year:: 2023
Address:: Toronto, Canada
Editors:: Jeremy Barnes, Orphée De Clercq, Roman Klinger
Venue:: WASSA
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 606–610
Language:
URL:: https://aclanthology.org/2023.wassa-1.59
DOI:: 10.18653/v1/2023.wassa-1.59
Bibkey:
Cite (ACL):: Bhavish Pahwa. 2023. BpHigh at WASSA 2023: Using Contrastive Learning to build Sentence Transformer models for Multi-Class Emotion Classification in Code-mixed Urdu. In Proceedings of the 13th Workshop on Computational Approaches to Subjectivity, Sentiment, & Social Media Analysis, pages 606–610, Toronto, Canada. Association for Computational Linguistics.
Cite (Informal):: BpHigh at WASSA 2023: Using Contrastive Learning to build Sentence Transformer models for Multi-Class Emotion Classification in Code-mixed Urdu (Pahwa, WASSA 2023)
Copy Citation:
PDF:: https://aclanthology.org/2023.wassa-1.59.pdf
Video:: https://aclanthology.org/2023.wassa-1.59.mp4

PDF Cite Search Video