Artem Orlovskyi

2026

Building ASR Resources for the Hutsul Dialect of Ukrainian
Roman Kyslyi | Artem Orlovskyi | Pavlo Khomenko | Bohdan Onyshchenko | Zakhar Guzii
Proceedings of the 13th Workshop on NLP for Similar Languages, Varieties and Dialects

Dialectal speech remains largely underexplored in Automatic Speech Recognition (ASR) research, particularly for Slavic languages. While Ukrainian ASR systems have rapidly improved in recent years with the adoption of Whisper, XLS-R, and Wav2Vec-based models, performance on dialectal variants remains unknown and often significantly degraded. In this work, we present the first dedicated effort to build ASR resources for the Hutsul dialect of Ukrainian. We develop a data preparation and segmentation pipeline, evaluate multiple forced alignment strategies, and benchmark state-of-the-art ASR models under zero-shot and fine-tuned conditions. We evaluate results using WER and CER demonstrating that large multilingual ASR models struggle with dialectal speech, while lightweight fine-tuning produces substantial improvements. All scripts, alignment tools, and training recipes are made publicly available to support future research on Ukrainian dialect speech.

Co-authors

Venues

VarDial1
WS1

Fix author