Jurgita Kapočiūtė-Dzikienė

Also published as: Jurgita Kapociute-Dzikiene


2025

pdf bib
Localizing AI: Evaluating Open-Weight Language Models for Languages of Baltic States
Jurgita Kapočiūtė-Dzikienė | Toms Bergmanis | Mārcis Pinnis
Proceedings of the Joint 25th Nordic Conference on Computational Linguistics and 11th Baltic Conference on Human Language Technologies (NoDaLiDa/Baltic-HLT 2025)

Although large language models (LLMs) have transformed our expectations of modern language technologies, concerns over data privacy often restrict the use of commercially available LLMs hosted outside of EU jurisdictions. This limits their application in governmental, defense, and other data-sensitive sectors. In this work, we evaluate the extent to which locally deployable open-weight large language models support lesser-spoken languages such as Lithuanian, Latvian, and Estonian. We examine various size and precision variants of the top-performing multilingual open-weight models, Llama 3, Gemma 2, Phi, and NeMo, on machine translation, multiple-choice question answering, and free-form text generation. The results indicate that while certain models like Gemma 2 perform close to the top commercially available models, many LLMs struggle with these languages. Most surprisingly, however, we find that these models, while showing close to state-of-the-art translation performance, are still prone to lexical hallucinations with errors in at least 1 in 20 words for all open-weight multilingual LLMs.

2015

pdf bib
The Effect of Author Set Size in Authorship Attribution for Lithuanian
Jurgita Kapočiūtė-Dzikienė | Ligita Šarkutė | Andrius Utka
Proceedings of the 20th Nordic Conference of Computational Linguistics (NODALIDA 2015)

pdf bib
Authorship Attribution and Author Profiling of Lithuanian Literary Texts
Jurgita Kapočiūtė-Dzikienė | Andrius Utka | Ligita Šarkutė
The 5th Workshop on Balto-Slavic Natural Language Processing

2013

pdf bib
A Comparison of Approaches for Sentiment Classification on Lithuanian Internet Comments
Jurgita Kapočiūtė-Dzikienė | Algis Krupavičius | Tomas Krilavičius
Proceedings of the 4th Biennial International Workshop on Balto-Slavic Natural Language Processing

pdf bib
Lithuanian Dependency Parsing with Rich Morphological Features
Jurgita Kapočiūtė-Dzikienė | Joakim Nivre | Algis Krupavičius
Proceedings of the Fourth Workshop on Statistical Parsing of Morphologically-Rich Languages

pdf bib
Exploring Features for Named Entity Recognition in Lithuanian Text Corpus
Jurgita Kapočiūtė-Dzikienė | Anders Nøklestad | Janne Bondi Johannessen | Algis Krupavičius
Proceedings of the 19th Nordic Conference of Computational Linguistics (NODALIDA 2013)

2012

pdf bib
Improving Topic Classification for Highly Inflective Languages
Jurgita Kapociute-Dzikiene | Frederik Vaassen | Walter Daelemans | Algis Krupavičius
Proceedings of COLING 2012