BelarusianGLUE: Towards a Natural Language Understanding Benchmark for Belarusian

Maksim Aparovich; Volha Harytskaya; Vladislav Poritski; Oksana Volchek; Pavel Smrz

doi:10.18653/v1/2025.acl-long.25

BelarusianGLUE: Towards a Natural Language Understanding Benchmark for Belarusian

Maksim Aparovich, Volha Harytskaya, Vladislav Poritski, Oksana Volchek, Pavel Smrz

Abstract

In the epoch of multilingual large language models (LLMs), it is still challenging to evaluate the models’ understanding of lower-resourced languages, which motivates further development of expert-crafted natural language understanding benchmarks. We introduce BelarusianGLUE — a natural language understanding benchmark for Belarusian, an East Slavic language, with ≈15K instances in five tasks: sentiment analysis, linguistic acceptability, word in context, Winograd schema challenge, textual entailment. A systematic evaluation of BERT models and LLMs against this novel benchmark reveals that both types of models approach human-level performance on easier tasks, such as sentiment analysis, but there is a significant gap in performance between machine and human on a harder task — Winograd schema challenge. We find the optimal choice of model type to be task-specific: e.g. BERT models underperform on textual entailment task but are competitive for linguistic acceptability. We release the datasets (https://hf.co/datasets/maaxap/BelarusianGLUE) and evaluation code (https://github.com/maaxap/BelarusianGLUE).

Anthology ID:: 2025.acl-long.25
Volume:: Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Month:: July
Year:: 2025
Address:: Vienna, Austria
Editors:: Wanxiang Che, Joyce Nabende, Ekaterina Shutova, Mohammad Taher Pilehvar
Venue:: ACL
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 511–527
Language:
URL:: https://aclanthology.org/2025.acl-long.25/
DOI:: 10.18653/v1/2025.acl-long.25
Bibkey:
Cite (ACL):: Maksim Aparovich, Volha Harytskaya, Vladislav Poritski, Oksana Volchek, and Pavel Smrz. 2025. BelarusianGLUE: Towards a Natural Language Understanding Benchmark for Belarusian. In Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 511–527, Vienna, Austria. Association for Computational Linguistics.
Cite (Informal):: BelarusianGLUE: Towards a Natural Language Understanding Benchmark for Belarusian (Aparovich et al., ACL 2025)
Copy Citation:
PDF:: https://aclanthology.org/2025.acl-long.25.pdf

PDF Cite Search Fix data