POSTECH machine translation system for IWSLT 2008 evaluation campaign.

Jonghoon Lee, Gary Geunbae Lee


Abstract
In this paper, we describe POSTECH system for IWSLT 2008 evaluation campaign. The system is based on phrase based statistical machine translation. We set up a baseline system using well known freely available software. A preprocessing method and a language modeling method have been applied to the baseline system in order to improve machine translation quality. The preprocessing method is to identify and remove useless tokens in source texts. And the language modeling method models phrase level n-gram. We have participated in the BTEC tasks to see the effects of our methods.
Anthology ID:
2008.iwslt-evaluation.14
Volume:
Proceedings of the 5th International Workshop on Spoken Language Translation: Evaluation Campaign
Month:
October 20-21
Year:
2008
Address:
Waikiki, Hawaii
Venue:
IWSLT
SIG:
SIGSLT
Publisher:
Note:
Pages:
98–103
Language:
URL:
https://aclanthology.org/2008.iwslt-evaluation.14
DOI:
Bibkey:
Cite (ACL):
Jonghoon Lee and Gary Geunbae Lee. 2008. POSTECH machine translation system for IWSLT 2008 evaluation campaign.. In Proceedings of the 5th International Workshop on Spoken Language Translation: Evaluation Campaign, pages 98–103, Waikiki, Hawaii.
Cite (Informal):
POSTECH machine translation system for IWSLT 2008 evaluation campaign. (Lee & Lee, IWSLT 2008)
Copy Citation:
PDF:
https://aclanthology.org/2008.iwslt-evaluation.14.pdf