An End-to-End Ukrainian RAG for Local Deployment. Optimized Hybrid Search and Lightweight Generation

Mykola Trokhymovych, Yana Oliinyk, Nazarii Nyzhnyk


Abstract
This paper presents a highly efficient Retrieval-Augmented Generation (RAG) system built specifically for Ukrainian document question answering, which achieved 2nd place in the UNLP 2026 Shared Task. Our solution features a custom two-stage search pipeline that retrieves relevant document pages, paired with a specialized Ukrainian language model fine-tuned on synthetic data to generate accurate, grounded answers. Finally, we compress the model for lightweight deployment. Evaluated under strict computational limits, our architecture demonstrates that high-quality, verifiable AI question answering can be achieved locally on resource-constrained hardware without sacrificing accuracy.
Anthology ID:
2026.unlp-1.19
Volume:
Proceedings of the Fifth Ukrainian Natural Language Processing Conference (UNLP 2026)
Month:
May
Year:
2026
Address:
Lviv, Ukraine
Editor:
Mariana Romanyshyn
Venue:
UNLP
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
223–229
Language:
URL:
https://aclanthology.org/2026.unlp-1.19/
DOI:
Bibkey:
Cite (ACL):
Mykola Trokhymovych, Yana Oliinyk, and Nazarii Nyzhnyk. 2026. An End-to-End Ukrainian RAG for Local Deployment. Optimized Hybrid Search and Lightweight Generation. In Proceedings of the Fifth Ukrainian Natural Language Processing Conference (UNLP 2026), pages 223–229, Lviv, Ukraine. Association for Computational Linguistics.
Cite (Informal):
An End-to-End Ukrainian RAG for Local Deployment. Optimized Hybrid Search and Lightweight Generation (Trokhymovych et al., UNLP 2026)
Copy Citation:
PDF:
https://aclanthology.org/2026.unlp-1.19.pdf