Machine Translation for Low Resource Turkic Languages: English-Tatar

Alexander Dikov


Abstract
This paper outlines our winning submission to the English-to-Tatar translation task. We evaluated three strategies: few-shot prompting with Gemini 3 Pro Preview, specialized trans-tokenized Tweeties models, and the RL-distilled TranslateGemma family. Results demonstrate that large commercial models significantly outperform smaller specialized ones in this low-resource setting. Gemini secured first place with a chrF++ score of 56.71, surpassing the open-source baseline of 25.23.
Anthology ID:
2026.loresmt-1.21
Volume:
Proceedings for the Ninth Workshop on Technologies for Machine Translation of Low Resource Languages (LoResMT 2026)
Month:
March
Year:
2026
Address:
Rabat, Morocco
Editors:
Atul Kr. Ojha, Chao-hong Liu, Ekaterina Vylomova, Flammie Pirinen, Jonathan Washington, Nathaniel Oco, Xiaobing Zhao
Venues:
LoResMT | WS
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
222–224
Language:
URL:
https://aclanthology.org/2026.loresmt-1.21/
DOI:
Bibkey:
Cite (ACL):
Alexander Dikov. 2026. Machine Translation for Low Resource Turkic Languages: English-Tatar. In Proceedings for the Ninth Workshop on Technologies for Machine Translation of Low Resource Languages (LoResMT 2026), pages 222–224, Rabat, Morocco. Association for Computational Linguistics.
Cite (Informal):
Machine Translation for Low Resource Turkic Languages: English-Tatar (Dikov, LoResMT 2026)
Copy Citation:
PDF:
https://aclanthology.org/2026.loresmt-1.21.pdf