The UNLP 2026 Shared Task on Multi-Domain Document Understanding

Volodymyr Sydorskyi, Nataliia Romanyshyn, Roman Kyslyi, Olena Nahorna


Abstract
This paper presents the results of the UNLP 2026 Shared Task on Multi-Domain Document Understanding. This Shared Task aims to challenge and assess AI capabilities to find the right information in a stack of domain-specific documents and generalize across domains. Participants were required not only to select the correct answer, but also to localize it by predicting the corresponding document and page. A total of 54 teams registered for the competition, 15 teams submitted systems, and 513 runs were evaluated on a hidden test set via Kaggle in a code-only submission format under constrained computational resources. The Kaggle leaderboard is left open for further submissions. Summarizing the contributions of this work, we establish a Ukrainian multi-domain document understanding benchmark, which consists of: (1) a collected dataset; (2) a proposed evaluation metric; and (3) an analysis of top-performing systems evaluated under a unified framework.
Anthology ID:
2026.unlp-1.22
Volume:
Proceedings of the Fifth Ukrainian Natural Language Processing Conference (UNLP 2026)
Month:
May
Year:
2026
Address:
Lviv, Ukraine
Editor:
Mariana Romanyshyn
Venue:
UNLP
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
249–259
Language:
URL:
https://aclanthology.org/2026.unlp-1.22/
DOI:
Bibkey:
Cite (ACL):
Volodymyr Sydorskyi, Nataliia Romanyshyn, Roman Kyslyi, and Olena Nahorna. 2026. The UNLP 2026 Shared Task on Multi-Domain Document Understanding. In Proceedings of the Fifth Ukrainian Natural Language Processing Conference (UNLP 2026), pages 249–259, Lviv, Ukraine. Association for Computational Linguistics.
Cite (Informal):
The UNLP 2026 Shared Task on Multi-Domain Document Understanding (Sydorskyi et al., UNLP 2026)
Copy Citation:
PDF:
https://aclanthology.org/2026.unlp-1.22.pdf