Larger feature set approach for machine translation in IWSLT 2007

Taro Watanabe, Jun Suzuki, Katsuhito Sudoh, Hajime Tsukada, Hideki Isozaki


Abstract
The NTT Statistical Machine Translation System employs a large number of feature functions. First, k-best translation candidates are generated by an efficient decoding method of hierarchical phrase-based translation. Second, the k-best translations are reranked. In both steps, sparse binary features — of the order of millions — are integrated during the search. This paper gives the details of the two steps and shows the results for the Evaluation campaign of the International Workshop on Spoken Language Translation (IWSLT) 2007.
Anthology ID:
2007.iwslt-1.16
Volume:
Proceedings of the Fourth International Workshop on Spoken Language Translation
Month:
October 15-16
Year:
2007
Address:
Trento, Italy
Venue:
IWSLT
SIG:
SIGSLT
Publisher:
Note:
Pages:
Language:
URL:
https://aclanthology.org/2007.iwslt-1.16
DOI:
Bibkey:
Cite (ACL):
Taro Watanabe, Jun Suzuki, Katsuhito Sudoh, Hajime Tsukada, and Hideki Isozaki. 2007. Larger feature set approach for machine translation in IWSLT 2007. In Proceedings of the Fourth International Workshop on Spoken Language Translation, Trento, Italy.
Cite (Informal):
Larger feature set approach for machine translation in IWSLT 2007 (Watanabe et al., IWSLT 2007)
Copy Citation:
PDF:
https://aclanthology.org/2007.iwslt-1.16.pdf