The speech recognition and machine translation system of IOIT for IWSLT 2013

Ngoc-Quan Pham, Hai-Son Le, Tat-Thang Vu, Chi-Mai Luong


Abstract
This paper describes the Automatic Speech Recognition (ASR) and Machine Translation (MT) systems developed by IOIT for the evaluation campaign of IWSLT2013. For the ASR task, using Kaldi toolkit, we developed the system based on weighted finite state transducer. The system is constructed by applying several techniques, notably, subspace Gaussian mixture models, speaker adaptation, discriminative training, system combination and SOUL, a neural network language model. The techniques used for automatic segmentation are also clarified. Besides, we compared different types of SOUL models in order to study the impact of words of previous sentences in predicting words in language modeling. For the MT task, the baseline system was built based on the open source toolkit N-code, then being augmented by using SOUL on top, i.e., in N-best rescoring phase.
Anthology ID:
2013.iwslt-evaluation.18
Volume:
Proceedings of the 10th International Workshop on Spoken Language Translation: Evaluation Campaign
Month:
December 5-6
Year:
2013
Address:
Heidelberg, Germany
Editor:
Joy Ying Zhang
Venue:
IWSLT
SIG:
SIGSLT
Publisher:
Note:
Pages:
Language:
URL:
https://aclanthology.org/2013.iwslt-evaluation.18
DOI:
Bibkey:
Cite (ACL):
Ngoc-Quan Pham, Hai-Son Le, Tat-Thang Vu, and Chi-Mai Luong. 2013. The speech recognition and machine translation system of IOIT for IWSLT 2013. In Proceedings of the 10th International Workshop on Spoken Language Translation: Evaluation Campaign, Heidelberg, Germany.
Cite (Informal):
The speech recognition and machine translation system of IOIT for IWSLT 2013 (Pham et al., IWSLT 2013)
Copy Citation:
PDF:
https://aclanthology.org/2013.iwslt-evaluation.18.pdf