Open Source Statistical Machine Translation

Philipp Koehn, Hieu Hoang


Abstract
If you are interested in open-source machine translation but lack hands-on experience, this is the tutorial for you! We will start with background knowledge of statistical machine translation and then walk you through the process of installing and running an SMT system. We will show you how to prepare input data, and the most efficient way to train and use your translation systems. We shall also discuss solutions to some of the most common issues that face LSPs when using SMT, including how to tailor systems to specific clients, preserving document layout and formatting, and efficient ways of incorporating new translation memories. Previous years’ participants have included software engineers and managers who need to have a detailed understanding of the SMT process. This is a fast-paced, hands-on tutorial that will cover the skills you need to get you up and running with open-source SMT. The teaching will be based on the Moses toolkit, the most popular open-source machine translation software currently available. No prior knowledge of MT is necessary, only an interest in it. A laptop is required for this tutorial, and you should have rudimentary knowledge of using the command line on Windows or Linux.
Anthology ID:
2012.amta-tutorials.5
Volume:
Proceedings of the 10th Conference of the Association for Machine Translation in the Americas: Tutorials
Month:
October 28-November 1
Year:
2012
Address:
San Diego, California, USA
Venue:
AMTA
SIG:
Publisher:
Association for Machine Translation in the Americas
Note:
Pages:
Language:
URL:
https://aclanthology.org/2012.amta-tutorials.5
DOI:
Bibkey:
Cite (ACL):
Philipp Koehn and Hieu Hoang. 2012. Open Source Statistical Machine Translation. In Proceedings of the 10th Conference of the Association for Machine Translation in the Americas: Tutorials, San Diego, California, USA. Association for Machine Translation in the Americas.
Cite (Informal):
Open Source Statistical Machine Translation (Koehn & Hoang, AMTA 2012)
Copy Citation:
PDF:
https://aclanthology.org/2012.amta-tutorials.5.pdf