Towards Large Vocabulary Kazakh-Russian Sign Language Dataset: KRSL-OnlineSchool

Medet Mukushev, Aigerim Kydyrbekova, Vadim Kimmelman, Anara Sandygulova


Abstract
This paper presents a new dataset for Kazakh-Russian Sign Language (KRSL) created for the purposes of Sign Language Processing. In 2020, Kazakhstan’s schools were quickly switched to online mode due to the COVID-19 pandemic. Every working day, the El-arna TV channel was broadcasting video lessons for grades from 1 to 11 with sign language translation. This opportunity allowed us to record a corpus with a large vocabulary and spontaneous SL interpretation. To this end, this corpus contains video recordings of Kazakhstan’s online school translated to Kazakh-Russian sign language by 7 interpreters. At the moment we collected and cleaned 890 hours of video material. A custom annotation tool was created to make the process of data annotation simple and easy-to-use by the Deaf community. To date, around 325 hours of videos have been annotated with glosses and 4,009 lessons out of 4,547 were transcribed with automatic speech-to-text software. The KRSL-OnlineSchool dataset will be made publicly available at https://krslproject.github.io/online-school/
Anthology ID:
2022.signlang-1.24
Volume:
Proceedings of the LREC2022 10th Workshop on the Representation and Processing of Sign Languages: Multilingual Sign Language Resources
Month:
June
Year:
2022
Address:
Marseille, France
Editors:
Eleni Efthimiou, Stavroula-Evita Fotinea, Thomas Hanke, Julie A. Hochgesang, Jette Kristoffersen, Johanna Mesch, Marc Schulder
Venue:
SignLang
SIG:
Publisher:
European Language Resources Association
Note:
Pages:
154–158
Language:
URL:
https://aclanthology.org/2022.signlang-1.24
DOI:
Bibkey:
Cite (ACL):
Medet Mukushev, Aigerim Kydyrbekova, Vadim Kimmelman, and Anara Sandygulova. 2022. Towards Large Vocabulary Kazakh-Russian Sign Language Dataset: KRSL-OnlineSchool. In Proceedings of the LREC2022 10th Workshop on the Representation and Processing of Sign Languages: Multilingual Sign Language Resources, pages 154–158, Marseille, France. European Language Resources Association.
Cite (Informal):
Towards Large Vocabulary Kazakh-Russian Sign Language Dataset: KRSL-OnlineSchool (Mukushev et al., SignLang 2022)
Copy Citation:
PDF:
https://aclanthology.org/2022.signlang-1.24.pdf
Data
How2Sign