LIMSI @ WMT 2020

Sadaf Abdul Rauf, José Carlos Rosales Núñez, Minh Quang Pham, François Yvon


Abstract
This paper describes LIMSI’s submissions to the translation shared tasks at WMT’20. This year we have focused our efforts on the biomedical translation task, developing a resource-heavy system for the translation of medical abstracts from English into French, using back-translated texts, terminological resources as well as multiple pre-processing pipelines, including pre-trained representations. Systems were also prepared for the robustness task for translating from English into German; for this large-scale task we developed multi-domain, noise-robust, translation systems aim to handle the two test conditions: zero-shot and few-shot domain adaptation.
Anthology ID:
2020.wmt-1.86
Volume:
Proceedings of the Fifth Conference on Machine Translation
Month:
November
Year:
2020
Address:
Online
Editors:
Loïc Barrault, Ondřej Bojar, Fethi Bougares, Rajen Chatterjee, Marta R. Costa-jussà, Christian Federmann, Mark Fishel, Alexander Fraser, Yvette Graham, Paco Guzman, Barry Haddow, Matthias Huck, Antonio Jimeno Yepes, Philipp Koehn, André Martins, Makoto Morishita, Christof Monz, Masaaki Nagata, Toshiaki Nakazawa, Matteo Negri
Venue:
WMT
SIG:
SIGMT
Publisher:
Association for Computational Linguistics
Note:
Pages:
803–812
Language:
URL:
https://aclanthology.org/2020.wmt-1.86
DOI:
Bibkey:
Cite (ACL):
Sadaf Abdul Rauf, José Carlos Rosales Núñez, Minh Quang Pham, and François Yvon. 2020. LIMSI @ WMT 2020. In Proceedings of the Fifth Conference on Machine Translation, pages 803–812, Online. Association for Computational Linguistics.
Cite (Informal):
LIMSI @ WMT 2020 (Abdul Rauf et al., WMT 2020)
Copy Citation:
PDF:
https://aclanthology.org/2020.wmt-1.86.pdf
Video:
 https://slideslive.com/38939618