IIT Bombay’s English-Indonesian submission at WAT: Integrating Neural Language Models with SMT

Sandhya Singh, Anoop Kunchukuttan, Pushpak Bhattacharyya


Abstract
This paper describes the IIT Bombay’s submission as a part of the shared task in WAT 2016 for English–Indonesian language pair. The results reported here are for both the direction of the language pair. Among the various approaches experimented, Operation Sequence Model (OSM) and Neural Language Model have been submitted for WAT. The OSM approach integrates translation and reordering process resulting in relatively improved translation. Similarly the neural experiment integrates Neural Language Model with Statistical Machine Translation (SMT) as a feature for translation. The Neural Probabilistic Language Model (NPLM) gave relatively high BLEU points for Indonesian to English translation system while the Neural Network Joint Model (NNJM) performed better for English to Indonesian direction of translation system. The results indicate improvement over the baseline Phrase-based SMT by 0.61 BLEU points for English-Indonesian system and 0.55 BLEU points for Indonesian-English translation system.
Anthology ID:
W16-4604
Volume:
Proceedings of the 3rd Workshop on Asian Translation (WAT2016)
Month:
December
Year:
2016
Address:
Osaka, Japan
Venues:
WAT | WS
SIG:
Publisher:
The COLING 2016 Organizing Committee
Note:
Pages:
68–74
Language:
URL:
https://aclanthology.org/W16-4604
DOI:
Bibkey:
Cite (ACL):
Sandhya Singh, Anoop Kunchukuttan, and Pushpak Bhattacharyya. 2016. IIT Bombay’s English-Indonesian submission at WAT: Integrating Neural Language Models with SMT. In Proceedings of the 3rd Workshop on Asian Translation (WAT2016), pages 68–74, Osaka, Japan. The COLING 2016 Organizing Committee.
Cite (Informal):
IIT Bombay’s English-Indonesian submission at WAT: Integrating Neural Language Models with SMT (Singh et al., 2016)
Copy Citation:
PDF:
https://aclanthology.org/W16-4604.pdf