Translating Step-by-Step: Decomposing the Translation Process for Improved Translation Quality of Long-Form Texts

Eleftheria Briakou, Jiaming Luo, Colin Cherry, Markus Freitag


Abstract
In this paper we present a step-by-step approach to long-form text translation, drawing on established processes in translation studies. Instead of viewing machine translation as a single, monolithic task, we propose a framework that engages language models in a multi-turn interaction, encompassing pre-translation research, drafting, refining, and proofreading, resulting in progressively improved translations.Extensive automatic evaluations using Gemini 1.5 Pro across ten language pairs show that translating step-by-step yields large translation quality improvements over conventional zero-shot prompting approaches and earlier human-like baseline strategies, resulting in state-of-the-art results on WMT 2024.
Anthology ID:
2024.wmt-1.123
Volume:
Proceedings of the Ninth Conference on Machine Translation
Month:
November
Year:
2024
Address:
Miami, Florida, USA
Editors:
Barry Haddow, Tom Kocmi, Philipp Koehn, Christof Monz
Venue:
WMT
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
1301–1317
Language:
URL:
https://aclanthology.org/2024.wmt-1.123
DOI:
Bibkey:
Cite (ACL):
Eleftheria Briakou, Jiaming Luo, Colin Cherry, and Markus Freitag. 2024. Translating Step-by-Step: Decomposing the Translation Process for Improved Translation Quality of Long-Form Texts. In Proceedings of the Ninth Conference on Machine Translation, pages 1301–1317, Miami, Florida, USA. Association for Computational Linguistics.
Cite (Informal):
Translating Step-by-Step: Decomposing the Translation Process for Improved Translation Quality of Long-Form Texts (Briakou et al., WMT 2024)
Copy Citation:
PDF:
https://aclanthology.org/2024.wmt-1.123.pdf