NICT-2 Translation System for WAT2016: Applying Domain Adaptation to Phrase-based Statistical Machine Translation

Kenji Imamura, Eiichiro Sumita


Abstract
This paper describes the NICT-2 translation system for the 3rd Workshop on Asian Translation. The proposed system employs a domain adaptation method based on feature augmentation. We regarded the Japan Patent Office Corpus as a mixture of four domain corpora and improved the translation quality of each domain. In addition, we incorporated language models constructed from Google n-grams as external knowledge. Our domain adaptation method can naturally incorporate such external knowledge that contributes to translation quality.
Anthology ID:
W16-4611
Volume:
Proceedings of the 3rd Workshop on Asian Translation (WAT2016)
Month:
December
Year:
2016
Address:
Osaka, Japan
Editors:
Toshiaki Nakazawa, Hideya Mino, Chenchen Ding, Isao Goto, Graham Neubig, Sadao Kurohashi, Ir. Hammam Riza, Pushpak Bhattacharyya
Venue:
WAT
SIG:
Publisher:
The COLING 2016 Organizing Committee
Note:
Pages:
126–132
Language:
URL:
https://aclanthology.org/W16-4611
DOI:
Bibkey:
Cite (ACL):
Kenji Imamura and Eiichiro Sumita. 2016. NICT-2 Translation System for WAT2016: Applying Domain Adaptation to Phrase-based Statistical Machine Translation. In Proceedings of the 3rd Workshop on Asian Translation (WAT2016), pages 126–132, Osaka, Japan. The COLING 2016 Organizing Committee.
Cite (Informal):
NICT-2 Translation System for WAT2016: Applying Domain Adaptation to Phrase-based Statistical Machine Translation (Imamura & Sumita, WAT 2016)
Copy Citation:
PDF:
https://aclanthology.org/W16-4611.pdf
Data
ASPEC