Yurii Laba
2024
Ukrainian Visual Word Sense Disambiguation Benchmark
Yurii Laba
|
Yaryna Mohytych
|
Ivanna Rohulia
|
Halyna Kyryleyza
|
Hanna Dydyk-Meush
|
Oles Dobosevych
|
Rostyslav Hryniv
Proceedings of the Third Ukrainian Natural Language Processing Workshop (UNLP) @ LREC-COLING 2024
This study presents a benchmark for evaluating the Visual Word Sense Disambiguation (Visual-WSD) task in Ukrainian. The main goal of the Visual-WSD task is to identify, with minimal contextual information, the most appropriate representation of a given ambiguous word from a set of ten images. To construct this benchmark, we followed a methodology similar to that proposed by (CITATION), who previously introduced benchmarks for the Visual-WSD task in English, Italian, and Farsi. This approach allows us to incorporate the Ukrainian benchmark into a broader framework for cross-language model performance comparisons. We collected the benchmark data semi-automatically and refined it with input from domain experts. We then assessed eight multilingual and multimodal large language models using this benchmark. All tested models performed worse than the zero-shot CLIP-based baseline model (CITATION) used by (CITATION) for the English Visual-WSD task. Our analysis revealed a significant performance gap in the Visual-WSD task between Ukrainian and English.
2023
Contextual Embeddings for Ukrainian: A Large Language Model Approach to Word Sense Disambiguation
Yurii Laba
|
Volodymyr Mudryi
|
Dmytro Chaplynskyi
|
Mariana Romanyshyn
|
Oles Dobosevych
Proceedings of the Second Ukrainian Natural Language Processing Workshop (UNLP)
This research proposes a novel approach to the Word Sense Disambiguation (WSD) task in the Ukrainian language based on supervised fine-tuning of a pre-trained Large Language Model (LLM) on the dataset generated in an unsupervised way to obtain better contextual embeddings for words with multiple senses. The paper presents a method for generating a new dataset for WSD evaluation in the Ukrainian language based on the SUM dictionary. We developed a comprehensive framework that facilitates the generation of WSD evaluation datasets, enables the use of different prediction strategies, LLMs, and pooling strategies, and generates multiple performance reports. Our approach shows 77,9% accuracy for lexical meaning prediction for homonyms.
Search
Co-authors
- Oles Dobosevych 2
- Yaryna Mohytych 1
- Ivanna Rohulia 1
- Halyna Kyryleyza 1
- Hanna Dydyk-Meush 1
- show all...
Venues
- unlp2