Unbabel’s Participation in the WMT19 Translation Quality Estimation Shared Task

Fabio Kepler; Jonay Trénous; Marcos Treviso; Miguel Vera; António Góis; M. Amin Farajian; António V. Lopes; André F. T. Martins

doi:10.18653/v1/W19-5406

Unbabel’s Participation in the WMT19 Translation Quality Estimation Shared Task

Fabio Kepler, Jonay Trénous, Marcos Treviso, Miguel Vera, António Góis, M. Amin Farajian, António V. Lopes, André F. T. Martins

Abstract

We present the contribution of the Unbabel team to the WMT 2019 Shared Task on Quality Estimation. We participated on the word, sentence, and document-level tracks, encompassing 3 language pairs: English-German, English-Russian, and English-French. Our submissions build upon the recent OpenKiwi framework: We combine linear, neural, and predictor-estimator systems with new transfer learning approaches using BERT and XLM pre-trained models. We compare systems individually and propose new ensemble techniques for word and sentence-level predictions. We also propose a simple technique for converting word labels into document-level predictions. Overall, our submitted systems achieve the best results on all tracks and language pairs by a considerable margin.

Anthology ID:: W19-5406
Volume:: Proceedings of the Fourth Conference on Machine Translation (Volume 3: Shared Task Papers, Day 2)
Month:: August
Year:: 2019
Address:: Florence, Italy
Editors:: Ondřej Bojar, Rajen Chatterjee, Christian Federmann, Mark Fishel, Yvette Graham, Barry Haddow, Matthias Huck, Antonio Jimeno Yepes, Philipp Koehn, André Martins, Christof Monz, Matteo Negri, Aurélie Névéol, Mariana Neves, Matt Post, Marco Turchi, Karin Verspoor
Venue:: WMT
SIG:: SIGMT
Publisher:: Association for Computational Linguistics
Note:
Pages:: 78–84
Language:
URL:: https://aclanthology.org/W19-5406/
DOI:: 10.18653/v1/W19-5406
Bibkey:
Cite (ACL):: Fabio Kepler, Jonay Trénous, Marcos Treviso, Miguel Vera, António Góis, M. Amin Farajian, António V. Lopes, and André F. T. Martins. 2019. Unbabel’s Participation in the WMT19 Translation Quality Estimation Shared Task. In Proceedings of the Fourth Conference on Machine Translation (Volume 3: Shared Task Papers, Day 2), pages 78–84, Florence, Italy. Association for Computational Linguistics.
Cite (Informal):: Unbabel’s Participation in the WMT19 Translation Quality Estimation Shared Task (Kepler et al., WMT 2019)
Copy Citation:
PDF:: https://aclanthology.org/W19-5406.pdf

PDF Cite Search Fix data