An Empirical Comparison of Domain Adaptation Methods for Neural Machine Translation

Chenhui Chu, Raj Dabre, Sadao Kurohashi


Abstract
In this paper, we propose a novel domain adaptation method named “mixed fine tuning” for neural machine translation (NMT). We combine two existing approaches namely fine tuning and multi domain NMT. We first train an NMT model on an out-of-domain parallel corpus, and then fine tune it on a parallel corpus which is a mix of the in-domain and out-of-domain corpora. All corpora are augmented with artificial tags to indicate specific domains. We empirically compare our proposed method against fine tuning and multi domain methods and discuss its benefits and shortcomings.
Anthology ID:
P17-2061
Volume:
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers)
Month:
July
Year:
2017
Address:
Vancouver, Canada
Editors:
Regina Barzilay, Min-Yen Kan
Venue:
ACL
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
385–391
Language:
URL:
https://aclanthology.org/P17-2061
DOI:
10.18653/v1/P17-2061
Bibkey:
Cite (ACL):
Chenhui Chu, Raj Dabre, and Sadao Kurohashi. 2017. An Empirical Comparison of Domain Adaptation Methods for Neural Machine Translation. In Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), pages 385–391, Vancouver, Canada. Association for Computational Linguistics.
Cite (Informal):
An Empirical Comparison of Domain Adaptation Methods for Neural Machine Translation (Chu et al., ACL 2017)
Copy Citation:
PDF:
https://aclanthology.org/P17-2061.pdf
Poster:
 P17-2061.Poster.pdf
Data
ASPEC