Sergey Kuldin


2024

pdf bib
FLORES+ Translation and Machine Translation Evaluation for the Erzya Language
Isai Gordeev | Sergey Kuldin | David Dale
Proceedings of the Ninth Conference on Machine Translation

This paper introduces a translation of the FLORES+ dataset into the endangered Erzya language, with the goal of evaluating machine translation between this language and any of the other 200 languages already included into FLORES+. This translation was carried out as a part of the Open Language Data shared task at WMT24. We also present a benchmark of existing translation models bases on this dataset and a new translation model that achieves the state-of-the-art quality of translation into Erzya from Russian and English.
Search
Venues