Incremental development of statistical machine translation systems

Li Gong, Aurélien Max, François Yvon


Abstract
Statistical Machine Translation produces results that make it a competitive option in most machine-assisted translation scenarios. However, these good results often come at a very high computational cost and correspond to training regimes which are unfit to many practical contexts, where the ability to adapt to users and domains and to continuously integrate new data (eg. in post-edition contexts) are of primary importance. In this article, we show how these requirements can be met using a strategy for on-demand word alignment and model estimation. Most remarkably, our incremental system development framework is shown to deliver top quality translation performance even in the absence of tuning, and to surpass a strong baseline when performing online tuning. All these results obtained with great computational savings as compared to conventional systems.
Anthology ID:
2014.iwslt-papers.9
Volume:
Proceedings of the 11th International Workshop on Spoken Language Translation: Papers
Month:
December 4-5
Year:
2014
Address:
Lake Tahoe, California
Editors:
Marcello Federico, Sebastian Stüker, François Yvon
Venue:
IWSLT
SIG:
SIGSLT
Publisher:
Note:
Pages:
214–222
Language:
URL:
https://aclanthology.org/2014.iwslt-papers.9
DOI:
Bibkey:
Cite (ACL):
Li Gong, Aurélien Max, and François Yvon. 2014. Incremental development of statistical machine translation systems. In Proceedings of the 11th International Workshop on Spoken Language Translation: Papers, pages 214–222, Lake Tahoe, California.
Cite (Informal):
Incremental development of statistical machine translation systems (Gong et al., IWSLT 2014)
Copy Citation:
PDF:
https://aclanthology.org/2014.iwslt-papers.9.pdf