Scenarios for Customizing an SMT Engine Based on Availability of Data

Kirti Vashee, Rustin Gibbs


Abstract
Although still in a nascent state as a professional translation tool, customized SMT engines already have multiple applications, each of which require clear definitions about quality and productivity. Three engine-training scenarios have emerged which are representative of real-world applications for the development and use of a customized SMT engines based on the availability of data. In the case that limited or no bilingual training data is available, a unique development process can be used to harvest and translate n-grams directly. Using this approach Asia Online and Moravia IT have successfully customized SMT engines for use in various domains. A partnership between an MT engine provider and a qualified LSP is essential to deliver quality results using this approach.
Anthology ID:
2010.amta-commercial.7
Volume:
Proceedings of the 9th Conference of the Association for Machine Translation in the Americas: Commercial MT User Program
Month:
October 31-November 4
Year:
2010
Address:
Denver, Colorado, USA
Venue:
AMTA
SIG:
Publisher:
Association for Machine Translation in the Americas
Note:
Pages:
Language:
URL:
https://aclanthology.org/2010.amta-commercial.7
DOI:
Bibkey:
Cite (ACL):
Kirti Vashee and Rustin Gibbs. 2010. Scenarios for Customizing an SMT Engine Based on Availability of Data. In Proceedings of the 9th Conference of the Association for Machine Translation in the Americas: Commercial MT User Program, Denver, Colorado, USA. Association for Machine Translation in the Americas.
Cite (Informal):
Scenarios for Customizing an SMT Engine Based on Availability of Data (Vashee & Gibbs, AMTA 2010)
Copy Citation:
PDF:
https://aclanthology.org/2010.amta-commercial.7.pdf