Evaluating Large Language Models on Lithuanian Grammatical Cases

Urtė Jakubauskaitė; Raquel G. Alhama

Evaluating Large Language Models on Lithuanian Grammatical Cases

Abstract

We present a systematic evaluation of large language models (LLMs) on Lithuanian grammatical case marking, a task that has received little prior attention. Lithuanian is a relatively low-resource language, with rich morphology and explicit marking. To enable fine-grained syntactic and morphological assessment, we introduce a novel dataset of 305 minimal sentence pairs contrasting correct and incorrect case usage. Our results show that case marking is challenging for current models, with overall accuracy ranging from 0.662 to 0.852. A monolingual Lithuanian LLM consistently outperforms multilingual counterparts, highlighting the value of language-specific training over model size. Performance varies across cases: genitive and locative forms are generally better handled, while rarer constructions and subtle functional distinctions remain difficult. The dataset and analysis provide a resource for future work, supporting the development of more robust LLMs and targeted evaluation benchmarks for morphologically rich, low-resource languages.

Anthology ID:: 2026.loreslm-1.32
Volume:: Proceedings of the Second Workshop on Language Models for Low-Resource Languages (LoResLM 2026)
Month:: March
Year:: 2026
Address:: Rabat, Morocco
Editors:: Hansi Hettiarachchi, Tharindu Ranasinghe, Alistair Plum, Paul Rayson, Ruslan Mitkov, Mohamed Gaber, Damith Premasiri, Fiona Anting Tan, Lasitha Uyangodage
Venue:: LoResLM
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 371–377
Language:
URL:: https://aclanthology.org/2026.loreslm-1.32/
DOI:
Bibkey:
Cite (ACL):: Urtė Jakubauskaitė and Raquel G. Alhama. 2026. Evaluating Large Language Models on Lithuanian Grammatical Cases. In Proceedings of the Second Workshop on Language Models for Low-Resource Languages (LoResLM 2026), pages 371–377, Rabat, Morocco. Association for Computational Linguistics.
Cite (Informal):: Evaluating Large Language Models on Lithuanian Grammatical Cases (Jakubauskaitė & Alhama, LoResLM 2026)
Copy Citation:
PDF:: https://aclanthology.org/2026.loreslm-1.32.pdf

PDF Cite Search Fix data