MUCS@LT-EDI2023: Learning Approaches for Hope Speech Detection in Social Media Text

Asha Hegde, Kavya G, Sharal Coelho, Hosahalli Lakshmaiah Shashirekha


Abstract
Hope plays a significant role in shaping human thoughts and actions and hope content has received limited attention in the realm of social media data analysis. The exploration of hope content helps to uncover the valuable insights into users’ aspirations, expectations, and emotional states. By delving into the analysis of hope content on social media platforms, researchers and analysts can gain a deeper understanding of how hope influences individuals’ behaviors, decisions, and overall well-being in the digital age. However, this area is rarely explored even for resource-high languages. To address the identification of hope text in social media platforms, this paper describes the models submitted by the team MUCS to “Hope Speech Detection for Equality, Diversity, and Inclusion (LT-EDI)” shared task organized at Recent Advances in Natural Language Processing (RANLP) - 2023. This shared task aims to classify a comment/post in English and code-mixed texts in three languages, namely, Bulgarian, Spanish, and Hindi into one of the two predefined categories, namely, “Hope speech” and “Non Hope speech”. Two models, namely: i) Hope_BERT - Linear Support Vector Classifier (LinearSVC) model trained by combining Bidirectional Encoder Representations from Transformers (BERT) embeddings and Term Frequency-Inverse Document Frequency (TF-IDF) of character n-grams with word boundary (char_wb) for English and ii) Hope_mBERT - LinearSVC model trained by combining Multilingual BERT (mBERT) embeddings and TF-IDF of char_wb for Bulgarian, Spanish, and Hindi code-mixed texts are proposed for the shared task to classify the given text into Hope or Non-Hope categories. The proposed models obtained 1st, 1st, 2nd, and 5th ranks for Spanish, Bulgarian, Hindi, and English texts respectively.
Anthology ID:
2023.ltedi-1.43
Volume:
Proceedings of the Third Workshop on Language Technology for Equality, Diversity and Inclusion
Month:
September
Year:
2023
Address:
Varna, Bulgaria
Editors:
Bharathi R. Chakravarthi, B. Bharathi, Joephine Griffith, Kalika Bali, Paul Buitelaar
Venues:
LTEDI | WS
SIG:
Publisher:
INCOMA Ltd., Shoumen, Bulgaria
Note:
Pages:
279–286
Language:
URL:
https://aclanthology.org/2023.ltedi-1.43
DOI:
Bibkey:
Cite (ACL):
Asha Hegde, Kavya G, Sharal Coelho, and Hosahalli Lakshmaiah Shashirekha. 2023. MUCS@LT-EDI2023: Learning Approaches for Hope Speech Detection in Social Media Text. In Proceedings of the Third Workshop on Language Technology for Equality, Diversity and Inclusion, pages 279–286, Varna, Bulgaria. INCOMA Ltd., Shoumen, Bulgaria.
Cite (Informal):
MUCS@LT-EDI2023: Learning Approaches for Hope Speech Detection in Social Media Text (Hegde et al., LTEDI-WS 2023)
Copy Citation:
PDF:
https://aclanthology.org/2023.ltedi-1.43.pdf