NTA at SemEval-2025 Task 11: Enhanced Multilingual Textual Multi-label Emotion Detection via Integrated Augmentation Learning

Nguyen Pham Hoang Le; An Nguyen Tran Khuong; Tram Nguyen Thi Ngoc; Thìn Đặng Văn

NTA at SemEval-2025 Task 11: Enhanced Multilingual Textual Multi-label Emotion Detection via Integrated Augmentation Learning

Nguyen Pham Hoang Le, An Nguyen Tran Khuong, Tram Nguyen Thi Ngoc, Thin Dang Van

Abstract

Emotion detection in text is crucial for various applications, but progress, especially in multi-label scenarios, is often hampered by data scarcity, particularly for low-resource languages like Emakhuwa and Tigrinya. This lack of data limits model performance and generalizability. To address this, the NTA team developed a system for SemEval-2025 Task 11, leveraging data augmentation techniques: swap, deletion, oversampling, emotion-focused synonym insertion and synonym replacement to enhance baseline models for multilingual textual multi-label emotion detection. Our proposed system achieved significantly higher macro F1-scores compared to the baseline across multiple languages, demonstrating a robust approach to tackling data scarcity. This resulted in a 17th place overall ranking on the private leaderboard, and remarkably, we achieved the highest score and became the winner in Tigrinya language, demonstrating the effectiveness of our approach in a low-resource setting.

Anthology ID:: 2025.semeval-1.237
Original:: 2025.semeval-1.237v1
Version 2:: 2025.semeval-1.237v2
Volume:: Proceedings of the 19th International Workshop on Semantic Evaluation (SemEval-2025)
Month:: July
Year:: 2025
Address:: Vienna, Austria
Editors:: Sara Rosenthal, Aiala Rosá, Debanjan Ghosh, Marcos Zampieri
Venues:: SemEval | WS
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 1795–1801
Language:
URL:: https://aclanthology.org/2025.semeval-1.237/
DOI:
Bibkey:
Cite (ACL):: Nguyen Pham Hoang Le, An Nguyen Tran Khuong, Tram Nguyen Thi Ngoc, and Thin Dang Van. 2025. NTA at SemEval-2025 Task 11: Enhanced Multilingual Textual Multi-label Emotion Detection via Integrated Augmentation Learning. In Proceedings of the 19th International Workshop on Semantic Evaluation (SemEval-2025), pages 1795–1801, Vienna, Austria. Association for Computational Linguistics.
Cite (Informal):: NTA at SemEval-2025 Task 11: Enhanced Multilingual Textual Multi-label Emotion Detection via Integrated Augmentation Learning (Pham Hoang Le et al., SemEval 2025)
Copy Citation:
PDF:: https://aclanthology.org/2025.semeval-1.237.pdf

PDF (v2) PDF (v1) Cite Search Fix data