Using Spoken Word Posterior Features in Neural Machine Translation

Kaho Osamura; Takatomo Kano; Sakriani Sakti; Katsuhito Sudoh; Satoshi Nakamura

Using Spoken Word Posterior Features in Neural Machine Translation

Kaho Osamura, Takatomo Kano, Sakriani Sakti, Katsuhito Sudoh, Satoshi Nakamura

Abstract

A spoken language translation (ST) system consists of at least two modules: an automatic speech recognition (ASR) system and a machine translation (MT) system. In most cases, an MT is only trained and optimized using error-free text data. If the ASR makes errors, the translation accuracy will be greatly reduced. Existing studies have shown that training MT systems with ASR parameters or word lattices can improve the translation quality. However, such an extension requires a large change in standard MT systems, resulting in a complicated model that is hard to train. In this paper, a neural sequence-to-sequence ASR is used as feature processing that is trained to produce word posterior features given spoken utterances. The resulting probabilistic features are used to train a neural MT (NMT) with only a slight modification. Experimental results reveal that the proposed method improved up to 5.8 BLEU scores with synthesized speech or 4.3 BLEU scores with the natural speech in comparison with a conventional cascaded-based ST system that translates from the 1-best ASR candidates.

Anthology ID:: 2018.iwslt-1.28
Volume:: Proceedings of the 15th International Conference on Spoken Language Translation
Month:: October 29-30
Year:: 2018
Address:: Brussels
Editors:: Marco Turchi, Jan Niehues, Marcello Frederico
Venue:: IWSLT
SIG:: SIGSLT
Publisher:: International Conference on Spoken Language Translation
Note:
Pages:: 189–195
Language:
URL:: https://aclanthology.org/2018.iwslt-1.28/
DOI:
Bibkey:
Cite (ACL):: Kaho Osamura, Takatomo Kano, Sakriani Sakti, Katsuhito Sudoh, and Satoshi Nakamura. 2018. Using Spoken Word Posterior Features in Neural Machine Translation. In Proceedings of the 15th International Conference on Spoken Language Translation, pages 189–195, Brussels. International Conference on Spoken Language Translation.
Cite (Informal):: Using Spoken Word Posterior Features in Neural Machine Translation (Osamura et al., IWSLT 2018)
Copy Citation:
PDF:: https://aclanthology.org/2018.iwslt-1.28.pdf

PDF Cite Search Fix data