Mapping Sentiments: A Journey into Low-Resource Luxembourgish Analysis

Nina Hosseini-Kivanani, Julien Kühn, Christoph Schommer


Abstract
Sentiment analysis (SA) plays a vital role in interpreting human opinions across different languages, especially in contexts like social media, product reviews, and other user-generated content. This study focuses on Luxembourgish, a low-resource language critical to Luxembourg’s identity, utilizing advanced deep learning models such as BERT, RoBERTa, LuxemBERTand LuxGPT-2. These models were enhanced with transfer learning, active learning strategies, and context-aware embeddings, enabling effective Luxembourgish processing. These models further improved with context-aware embeddings and were able to accurately detect sentiments, categorizing news comments into positive, negative, and neutral sentiments. Our approach highlights the significant role of human-in-the-loop (HITL) methodologies, which refine model accuracy by aligning automated analyses with human judgment. The findings indicate that LuxembBERT, especially when enhanced with the HITL method involving feedback from 500 and 1000 annotated sentences, outperforms other models in both binary (positive vs. negative) and multi-class (positive, neutral, and negative) classification tasks. The HITL approach not only refined model accuracy but also provided substantial improvements in understanding and processing sentiments and sarcasm, often challenging for automated systems. This study establishes the basis for future research to extend these methodologies to other underresourced languages, promising improvements in Natural Language Processing (NLP) applications across diverse linguistic landscapes.
Anthology ID:
2024.luhme-1.3
Volume:
Proceedings of the First LUHME Workshop
Month:
October
Year:
2024
Address:
Santiago de Compostela, Spain
Editors:
Rui Sousa-Silva, Henrique Lopes Cardoso, Maarit Koponen, Antonio Pareja Lora, Márta Seresi
Venues:
LUHME | WS
SIG:
Publisher:
CLUP, Centro de Linguística da Universidade do Porto FLUP - Faculdade de Letras da Universidade do Porto
Note:
Pages:
20–27
Language:
URL:
https://aclanthology.org/2024.luhme-1.3/
DOI:
Bibkey:
Cite (ACL):
Nina Hosseini-Kivanani, Julien Kühn, and Christoph Schommer. 2024. Mapping Sentiments: A Journey into Low-Resource Luxembourgish Analysis. In Proceedings of the First LUHME Workshop, pages 20–27, Santiago de Compostela, Spain. CLUP, Centro de Linguística da Universidade do Porto FLUP - Faculdade de Letras da Universidade do Porto.
Cite (Informal):
Mapping Sentiments: A Journey into Low-Resource Luxembourgish Analysis (Hosseini-Kivanani et al., LUHME 2024)
Copy Citation:
PDF:
https://aclanthology.org/2024.luhme-1.3.pdf