Improving the Transformer Translation Model with Document-Level Context

Jiacheng Zhang; Huanbo Luan; Maosong Sun (孙茂松); Feifei Zhai; Jingfang Xu; Min Zhang (张民); Yang Liu (刘洋)

doi:10.18653/v1/D18-1049

Improving the Transformer Translation Model with Document-Level Context

Jiacheng Zhang, Huanbo Luan, Maosong Sun, Feifei Zhai, Jingfang Xu, Min Zhang, Yang Liu

Abstract

Although the Transformer translation model (Vaswani et al., 2017) has achieved state-of-the-art performance in a variety of translation tasks, how to use document-level context to deal with discourse phenomena problematic for Transformer still remains a challenge. In this work, we extend the Transformer model with a new context encoder to represent document-level context, which is then incorporated into the original encoder and decoder. As large-scale document-level parallel corpora are usually not available, we introduce a two-step training method to take full advantage of abundant sentence-level parallel corpora and limited document-level parallel corpora. Experiments on the NIST Chinese-English datasets and the IWSLT French-English datasets show that our approach improves over Transformer significantly.

Anthology ID:: D18-1049
Volume:: Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing
Month:: October-November
Year:: 2018
Address:: Brussels, Belgium
Editors:: Ellen Riloff, David Chiang, Julia Hockenmaier, Jun’ichi Tsujii
Venue:: EMNLP
SIG:: SIGDAT
Publisher:: Association for Computational Linguistics
Note:
Pages:: 533–542
Language:
URL:: https://aclanthology.org/D18-1049/
DOI:: 10.18653/v1/D18-1049
Bibkey:
Cite (ACL):: Jiacheng Zhang, Huanbo Luan, Maosong Sun, Feifei Zhai, Jingfang Xu, Min Zhang, and Yang Liu. 2018. Improving the Transformer Translation Model with Document-Level Context. In Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, pages 533–542, Brussels, Belgium. Association for Computational Linguistics.
Cite (Informal):: Improving the Transformer Translation Model with Document-Level Context (Zhang et al., EMNLP 2018)
Copy Citation:
PDF:: https://aclanthology.org/D18-1049.pdf

PDF Cite Search Fix data