Isai Gordeev
2024
FLORES+ Translation and Machine Translation Evaluation for the Erzya Language
Isai Gordeev
|
Sergey Kuldin
|
David Dale
Proceedings of the Ninth Conference on Machine Translation
This paper introduces a translation of the FLORES+ dataset into the endangered Erzya language, with the goal of evaluating machine translation between this language and any of the other 200 languages already included into FLORES+. This translation was carried out as a part of the Open Language Data shared task at WMT24. We also present a benchmark of existing translation models bases on this dataset and a new translation model that achieves the state-of-the-art quality of translation into Erzya from Russian and English.