UAReviews: A Multi-Task Ukrainian Dataset for Emotion and Intent Classification

Roman Kyslyi, Ihor Pysmennyi, Denys Mykhailov


Abstract
We introduce UAReviews, a multi-task Ukrainian-language dataset for emotion and intent classification comprising 11,580 annotated texts. The dataset combines two sources: citizen reviews of government digital services provided by the Ministry of Digital Transformation of Ukraine and Ukrainian-language Telegram posts drawn from the COSMUS corpus. Each text is annotated with both an emotion label following the Ekman taxonomy (seven classes) and an intent label (five classes), making it the first publicly available Ukrainian resource for joint emotion and intent analysis. Annotation was performed by students at the Anonymous Institution, with a gold standard subset (20\%) validated by three independent annotators achieving Krippendorff’s alpha = 0.93. We establish baselines using single-task and multi-task fine-tuned XLM-RoBERTa models and analyze emotion to intent correlation. Both the dataset and the baseline models are publicly available.
Anthology ID:
2026.unlp-1.2
Volume:
Proceedings of the Fifth Ukrainian Natural Language Processing Conference (UNLP 2026)
Month:
May
Year:
2026
Address:
Lviv, Ukraine
Editor:
Mariana Romanyshyn
Venue:
UNLP
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
12–23
Language:
URL:
https://aclanthology.org/2026.unlp-1.2/
DOI:
Bibkey:
Cite (ACL):
Roman Kyslyi, Ihor Pysmennyi, and Denys Mykhailov. 2026. UAReviews: A Multi-Task Ukrainian Dataset for Emotion and Intent Classification. In Proceedings of the Fifth Ukrainian Natural Language Processing Conference (UNLP 2026), pages 12–23, Lviv, Ukraine. Association for Computational Linguistics.
Cite (Informal):
UAReviews: A Multi-Task Ukrainian Dataset for Emotion and Intent Classification (Kyslyi et al., UNLP 2026)
Copy Citation:
PDF:
https://aclanthology.org/2026.unlp-1.2.pdf