UkrSL: Towards a Ukrainian Continuous Sign Language Dataset

Oleksandr Sobetskyi, Maryna Kosse, Roman Kyslyi, Angelina Savchenko


Abstract
We present UkrSL, an annotated dataset for Ukrainian Sign Language (USL) — one ofthe most underresourced sign languages in Europe. The dataset comprises 1,456 annotated clips (1,463 with cropped video segments) totalling approximately two hours of signing, sourced from six broadcast videos from Suspilne, Ukraine’s public broadcaster.Each clip is annotated with a spoken Ukrainian transcription aligned to the corresponding signing segment. We describe the data collection pipeline, the annotation methodology, and provide a detailed analysis of the dataset’s statistics and limitations. The dataset is being actively expanded, and we release this snapshot to support the research community and invite collaboration.
Anthology ID:
2026.unlp-1.6
Volume:
Proceedings of the Fifth Ukrainian Natural Language Processing Conference (UNLP 2026)
Month:
May
Year:
2026
Address:
Lviv, Ukraine
Editor:
Mariana Romanyshyn
Venue:
UNLP
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
53–57
Language:
URL:
https://aclanthology.org/2026.unlp-1.6/
DOI:
Bibkey:
Cite (ACL):
Oleksandr Sobetskyi, Maryna Kosse, Roman Kyslyi, and Angelina Savchenko. 2026. UkrSL: Towards a Ukrainian Continuous Sign Language Dataset. In Proceedings of the Fifth Ukrainian Natural Language Processing Conference (UNLP 2026), pages 53–57, Lviv, Ukraine. Association for Computational Linguistics.
Cite (Informal):
UkrSL: Towards a Ukrainian Continuous Sign Language Dataset (Sobetskyi et al., UNLP 2026)
Copy Citation:
PDF:
https://aclanthology.org/2026.unlp-1.6.pdf