TSU HITS’s Submissions to the WMT 2024 General Machine Translation Shared Task

Vladimir Mynka, Nikolay Mikhaylovskiy


Abstract
This paper describes the TSU HITS team’s submission system for the WMT’24 general translation task. We focused on exploring the capabilities of discrete diffusion models for the English-to-{Russian, German, Czech, Spanish} translation tasks in the constrained track. Our submission system consists of a set of discrete diffusion models for each language pair. The main advance is using a separate length regression model to determine the length of the output sequence more precisely.
Anthology ID:
2024.wmt-1.13
Volume:
Proceedings of the Ninth Conference on Machine Translation
Month:
November
Year:
2024
Address:
Miami, Florida, USA
Editors:
Barry Haddow, Tom Kocmi, Philipp Koehn, Christof Monz
Venue:
WMT
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
205–209
Language:
URL:
https://aclanthology.org/2024.wmt-1.13
DOI:
Bibkey:
Cite (ACL):
Vladimir Mynka and Nikolay Mikhaylovskiy. 2024. TSU HITS’s Submissions to the WMT 2024 General Machine Translation Shared Task. In Proceedings of the Ninth Conference on Machine Translation, pages 205–209, Miami, Florida, USA. Association for Computational Linguistics.
Cite (Informal):
TSU HITS’s Submissions to the WMT 2024 General Machine Translation Shared Task (Mynka & Mikhaylovskiy, WMT 2024)
Copy Citation:
PDF:
https://aclanthology.org/2024.wmt-1.13.pdf