Bolat Tleubayev


2022

pdf bib
Cyrillic-MNIST: a Cyrillic Version of the MNIST Dataset
Bolat Tleubayev | Zhanel Zhexenova | Kenessary Koishybay | Anara Sandygulova
Proceedings of the Thirteenth Language Resources and Evaluation Conference

This paper presents a new handwritten dataset, Cyrillic-MNIST, a Cyrillic version of the MNIST dataset, comprising of 121,234 samples of 42 Cyrillic letters. The performance of Cyrillic-MNIST is evaluated using standard deep learning approaches and is compared to the Extended MNIST (EMNIST) dataset. The dataset is available at https://github.com/bolattleubayev/cmnist