CUNI-Bergamot Submission at WMT22 General Translation Task

Josef Jon, Martin Popel, Ondřej Bojar


Abstract
We present the CUNI-Bergamot submission for the WMT22 General translation task. We compete in English-Czech direction. Our submission further explores block backtranslation techniques. Compared to the previous work, we measure performance in terms of COMET score and named entities translation accuracy. We evaluate performance of MBR decoding compared to traditional mixed backtranslation training and we show a possible synergy when using both of the techniques simultaneously. The results show that both approaches are effective means of improving translation quality and they yield even better results when combined.
Anthology ID:
2022.wmt-1.21
Volume:
Proceedings of the Seventh Conference on Machine Translation (WMT)
Month:
December
Year:
2022
Address:
Abu Dhabi, United Arab Emirates (Hybrid)
Editors:
Philipp Koehn, Loïc Barrault, Ondřej Bojar, Fethi Bougares, Rajen Chatterjee, Marta R. Costa-jussà, Christian Federmann, Mark Fishel, Alexander Fraser, Markus Freitag, Yvette Graham, Roman Grundkiewicz, Paco Guzman, Barry Haddow, Matthias Huck, Antonio Jimeno Yepes, Tom Kocmi, André Martins, Makoto Morishita, Christof Monz, Masaaki Nagata, Toshiaki Nakazawa, Matteo Negri, Aurélie Névéol, Mariana Neves, Martin Popel, Marco Turchi, Marcos Zampieri
Venue:
WMT
SIG:
SIGMT
Publisher:
Association for Computational Linguistics
Note:
Pages:
280–289
Language:
URL:
https://aclanthology.org/2022.wmt-1.21
DOI:
Bibkey:
Cite (ACL):
Josef Jon, Martin Popel, and Ondřej Bojar. 2022. CUNI-Bergamot Submission at WMT22 General Translation Task. In Proceedings of the Seventh Conference on Machine Translation (WMT), pages 280–289, Abu Dhabi, United Arab Emirates (Hybrid). Association for Computational Linguistics.
Cite (Informal):
CUNI-Bergamot Submission at WMT22 General Translation Task (Jon et al., WMT 2022)
Copy Citation:
PDF:
https://aclanthology.org/2022.wmt-1.21.pdf