A Hybrid Multilingual Approach to Sentiment Analysis for Uralic and Low-Resource Languages: Combining Extractive and Abstractive Techniques

Mikhail Krasitskii, Olga Kolesnikova, Grigori Sidorov, Alexander Gelbukh


Abstract
This paper introduces a novel hybrid architecture for multilingual sentiment analysis specifically designed for morphologically complex Uralic languages. Our approach synergistically combines extractive and abstractive summarization with specialized morphological processing for agglutinative structures. The proposed model integrates dynamic thresholding mechanisms and culturally-aware attention layers, achieving statistically significant improvements of 12% accuracy for Uralic languages (p < 0.01) while outperforming state-of-the-art alternatives in summarization quality (ROUGE 1: 0.60 vs. 0.52). Key innovations include language-specific stemmers for Finno-Ugric languages and cross-Uralic transfer learning, yielding 15.7% improvement in recall while maintaining 98.2% precision. Comprehensive evaluations across multiple datasets demonstrate consistent superiority over contemporary baselines, with particular emphasis on addressing Uralic language processing challenges.
Anthology ID:
2025.iwclul-1.5
Volume:
Proceedings of the 10th International Workshop on Computational Linguistics for Uralic Languages
Month:
December
Year:
2025
Address:
Joensuu, Finland
Editors:
Mika Hämäläinen, Michael Rießler, Eiaki V. Morooka, Lev Kharlashkin
Venues:
IWCLUL | WS
SIG:
SIGUR
Publisher:
Association for Computational Linguistics
Note:
Pages:
29–38
Language:
URL:
https://aclanthology.org/2025.iwclul-1.5/
DOI:
Bibkey:
Cite (ACL):
Mikhail Krasitskii, Olga Kolesnikova, Grigori Sidorov, and Alexander Gelbukh. 2025. A Hybrid Multilingual Approach to Sentiment Analysis for Uralic and Low-Resource Languages: Combining Extractive and Abstractive Techniques. In Proceedings of the 10th International Workshop on Computational Linguistics for Uralic Languages, pages 29–38, Joensuu, Finland. Association for Computational Linguistics.
Cite (Informal):
A Hybrid Multilingual Approach to Sentiment Analysis for Uralic and Low-Resource Languages: Combining Extractive and Abstractive Techniques (Krasitskii et al., IWCLUL 2025)
Copy Citation:
PDF:
https://aclanthology.org/2025.iwclul-1.5.pdf