Improving Zero-shot Translation with Language-Independent Constraints

Ngoc-Quan Pham; Jan Niehues; Thanh-Le Ha; Alex Waibel

doi:10.18653/v1/W19-5202

Improving Zero-shot Translation with Language-Independent Constraints

Ngoc-Quan Pham, Jan Niehues, Thanh-Le Ha, Alexander Waibel

Abstract

An important concern in training multilingual neural machine translation (NMT) is to translate between language pairs unseen during training, i.e zero-shot translation. Improving this ability kills two birds with one stone by providing an alternative to pivot translation which also allows us to better understand how the model captures information between languages. In this work, we carried out an investigation on this capability of the multilingual NMT models. First, we intentionally create an encoder architecture which is independent with respect to the source language. Such experiments shed light on the ability of NMT encoders to learn multilingual representations, in general. Based on such proof of concept, we were able to design regularization methods into the standard Transformer model, so that the whole architecture becomes more robust in zero-shot conditions. We investigated the behaviour of such models on the standard IWSLT 2017 multilingual dataset. We achieved an average improvement of 2.23 BLEU points across 12 language pairs compared to the zero-shot performance of a state-of-the-art multilingual system. Additionally, we carry out further experiments in which the effect is confirmed even for language pairs with multiple intermediate pivots.

Anthology ID:: W19-5202
Volume:: Proceedings of the Fourth Conference on Machine Translation (Volume 1: Research Papers)
Month:: August
Year:: 2019
Address:: Florence, Italy
Editors:: Ondřej Bojar, Rajen Chatterjee, Christian Federmann, Mark Fishel, Yvette Graham, Barry Haddow, Matthias Huck, Antonio Jimeno Yepes, Philipp Koehn, André Martins, Christof Monz, Matteo Negri, Aurélie Névéol, Mariana Neves, Matt Post, Marco Turchi, Karin Verspoor
Venue:: WMT
SIG:: SIGMT
Publisher:: Association for Computational Linguistics
Note:
Pages:: 13–23
Language:
URL:: https://aclanthology.org/W19-5202/
DOI:: 10.18653/v1/W19-5202
Bibkey:
Cite (ACL):: Ngoc-Quan Pham, Jan Niehues, Thanh-Le Ha, and Alexander Waibel. 2019. Improving Zero-shot Translation with Language-Independent Constraints. In Proceedings of the Fourth Conference on Machine Translation (Volume 1: Research Papers), pages 13–23, Florence, Italy. Association for Computational Linguistics.
Cite (Informal):: Improving Zero-shot Translation with Language-Independent Constraints (Pham et al., WMT 2019)
Copy Citation:
PDF:: https://aclanthology.org/W19-5202.pdf

PDF Cite Search Fix data