I2R Chinese-English translation system for IWSLT 2007

Boxing Chen, Jun Sun, Hongfei Jiang, Min Zhang, Ai Ti Aw


Abstract
In this paper, we describe the system and approach used by Institute for Infocomm Research (I2R) for the IWSLT 2007 spoken language evaluation campaign. A multi-pass approach is exploited to generate and select best translation. First, we use two decoders namely the open source Moses and an in-home syntax-based decoder to generate N-best lists. Next we spawn new translation entries through a word-based n-gram language model estimated on the former N-best entries. Finally, we join the N-best lists from the previous two passes, and select the best translation by rescoring them with additional feature functions. In particular, this paper reports our effort on new translation entry generation and system combination. The performance on development and test sets are reported. The system was ranked first with respect to the BLEU measure in Chinese-to-English open data track.
Anthology ID:
2007.iwslt-1.8
Volume:
Proceedings of the Fourth International Workshop on Spoken Language Translation
Month:
October 15-16
Year:
2007
Address:
Trento, Italy
Venue:
IWSLT
SIG:
SIGSLT
Publisher:
Note:
Pages:
Language:
URL:
https://aclanthology.org/2007.iwslt-1.8
DOI:
Bibkey:
Cite (ACL):
Boxing Chen, Jun Sun, Hongfei Jiang, Min Zhang, and Ai Ti Aw. 2007. I2R Chinese-English translation system for IWSLT 2007. In Proceedings of the Fourth International Workshop on Spoken Language Translation, Trento, Italy.
Cite (Informal):
I2R Chinese-English translation system for IWSLT 2007 (Chen et al., IWSLT 2007)
Copy Citation:
PDF:
https://aclanthology.org/2007.iwslt-1.8.pdf