Bartłomiej Boczek
2020
Samsung R&D Institute Poland submission to WMT20 News Translation Task
Mateusz Krubiński
|
Marcin Chochowski
|
Bartłomiej Boczek
|
Mikołaj Koszowski
|
Adam Dobrowolski
|
Marcin Szymański
|
Paweł Przybysz
Proceedings of the Fifth Conference on Machine Translation
This paper describes the submission to the WMT20 shared news translation task by Samsung R&D Institute Poland. We submitted systems for six language directions: English to Czech, Czech to English, English to Polish, Polish to English, English to Inuktitut and Inuktitut to English. For each, we trained a single-direction model. However, directions including English, Polish and Czech were derived from a common multilingual base, which was later fine-tuned on each particular direction. For all the translation directions, we used a similar training regime, with iterative training corpora improvement through back-translation and model ensembling. For the En → Cs direction, we additionally leveraged document-level information by re-ranking the beam output with a separate model.
Search
Co-authors
- Mateusz Krubiński 1
- Marcin Chochowski 1
- Mikołaj Koszowski 1
- Adam Dobrowolski 1
- Marcin Szymański 1
- show all...
Venues
- wmt1