Jurgita Kapočiūtė-Dzikienė
Also published as: Jurgita Kapociute-Dzikiene
2025
Localizing AI: Evaluating Open-Weight Language Models for Languages of Baltic States
Jurgita Kapočiūtė-Dzikienė | Toms Bergmanis | Mārcis Pinnis
Proceedings of the Joint 25th Nordic Conference on Computational Linguistics and 11th Baltic Conference on Human Language Technologies (NoDaLiDa/Baltic-HLT 2025)
Jurgita Kapočiūtė-Dzikienė | Toms Bergmanis | Mārcis Pinnis
Proceedings of the Joint 25th Nordic Conference on Computational Linguistics and 11th Baltic Conference on Human Language Technologies (NoDaLiDa/Baltic-HLT 2025)
Although large language models (LLMs) have transformed our expectations of modern language technologies, concerns over data privacy often restrict the use of commercially available LLMs hosted outside of EU jurisdictions. This limits their application in governmental, defense, and other data-sensitive sectors. In this work, we evaluate the extent to which locally deployable open-weight large language models support lesser-spoken languages such as Lithuanian, Latvian, and Estonian. We examine various size and precision variants of the top-performing multilingual open-weight models, Llama 3, Gemma 2, Phi, and NeMo, on machine translation, multiple-choice question answering, and free-form text generation. The results indicate that while certain models like Gemma 2 perform close to the top commercially available models, many LLMs struggle with these languages. Most surprisingly, however, we find that these models, while showing close to state-of-the-art translation performance, are still prone to lexical hallucinations with errors in at least 1 in 20 words for all open-weight multilingual LLMs.
2015
Authorship Attribution and Author Profiling of Lithuanian Literary Texts
Jurgita Kapočiūtė-Dzikienė | Andrius Utka | Ligita Šarkutė
The 5th Workshop on Balto-Slavic Natural Language Processing
Jurgita Kapočiūtė-Dzikienė | Andrius Utka | Ligita Šarkutė
The 5th Workshop on Balto-Slavic Natural Language Processing
The Effect of Author Set Size in Authorship Attribution for Lithuanian
Jurgita Kapočiūtė-Dzikienė | Ligita Šarkutė | Andrius Utka
Proceedings of the 20th Nordic Conference of Computational Linguistics (NODALIDA 2015)
Jurgita Kapočiūtė-Dzikienė | Ligita Šarkutė | Andrius Utka
Proceedings of the 20th Nordic Conference of Computational Linguistics (NODALIDA 2015)
2013
Exploring Features for Named Entity Recognition in Lithuanian Text Corpus
Jurgita Kapočiūtė-Dzikienė | Anders Nøklestad | Janne Bondi Johannessen | Algis Krupavičius
Proceedings of the 19th Nordic Conference of Computational Linguistics (NODALIDA 2013)
Jurgita Kapočiūtė-Dzikienė | Anders Nøklestad | Janne Bondi Johannessen | Algis Krupavičius
Proceedings of the 19th Nordic Conference of Computational Linguistics (NODALIDA 2013)
A Comparison of Approaches for Sentiment Classification on Lithuanian Internet Comments
Jurgita Kapočiūtė-Dzikienė | Algis Krupavičius | Tomas Krilavičius
Proceedings of the 4th Biennial International Workshop on Balto-Slavic Natural Language Processing
Jurgita Kapočiūtė-Dzikienė | Algis Krupavičius | Tomas Krilavičius
Proceedings of the 4th Biennial International Workshop on Balto-Slavic Natural Language Processing
Lithuanian Dependency Parsing with Rich Morphological Features
Jurgita Kapočiūtė-Dzikienė | Joakim Nivre | Algis Krupavičius
Proceedings of the Fourth Workshop on Statistical Parsing of Morphologically-Rich Languages
Jurgita Kapočiūtė-Dzikienė | Joakim Nivre | Algis Krupavičius
Proceedings of the Fourth Workshop on Statistical Parsing of Morphologically-Rich Languages