The NICT/ATR speech translation system for IWSLT 2008.

Masao Utiyama, Andrew Finch, Hideo Okuma, Michael Paul, Hailong Cao, Hirofumi Yamamoto, Keiji Yasuda, Eiichiro Sumita


Abstract
This paper describes the National Institute of Information and Communications Technology/Advanced Telecommunications Research Institute International (NICT/ATR) statistical machine translation (SMT) system used for the IWSLT 2008 evaluation campaign. We participated in the Chinese–English (Challenge Task), English–Chinese (Challenge Task), Chinese–English (BTEC Task), Chinese–Spanish (BTEC Task), and Chinese–English–Spanish (PIVOT Task) translation tasks. In the English–Chinese translation Challenge Task, we focused on exploring various factors for the English–Chinese translation because the research on the translation of English–Chinese is scarce compared to the opposite direction. In the Chinese–English translation Challenge Task, we employed a novel clustering method, where training sentences similar to the development data in terms of the word error rate formed a cluster. In the pivot translation task, we integrated two strategies for pivot translation by linear interpolation.
Anthology ID:
2008.iwslt-evaluation.11
Volume:
Proceedings of the 5th International Workshop on Spoken Language Translation: Evaluation Campaign
Month:
October 20-21
Year:
2008
Address:
Waikiki, Hawaii
Venue:
IWSLT
SIG:
SIGSLT
Publisher:
Note:
Pages:
77–84
Language:
URL:
https://aclanthology.org/2008.iwslt-evaluation.11
DOI:
Bibkey:
Cite (ACL):
Masao Utiyama, Andrew Finch, Hideo Okuma, Michael Paul, Hailong Cao, Hirofumi Yamamoto, Keiji Yasuda, and Eiichiro Sumita. 2008. The NICT/ATR speech translation system for IWSLT 2008.. In Proceedings of the 5th International Workshop on Spoken Language Translation: Evaluation Campaign, pages 77–84, Waikiki, Hawaii.
Cite (Informal):
The NICT/ATR speech translation system for IWSLT 2008. (Utiyama et al., IWSLT 2008)
Copy Citation:
PDF:
https://aclanthology.org/2008.iwslt-evaluation.11.pdf