A Detailed Analysis of Phrase-based and Syntax-based MT: The Search for Systematic Differences

Rasoul Samad Zadeh Kaljahi, Raphael Rubino, Johann Roturier, Jennifer Foster


Abstract
This paper describes a range of automatic and manual comparisons of phrase-based and syntax-based statistical machine translation methods applied to English-German and English-French translation of user-generated content. The syntax-based methods underperform the phrase-based models and the relaxation of syntactic constraints to broaden translation rule coverage means that these models do not necessarily generate output which is more grammatical than the output produced by the phrase-based models. Although the systems generate different output and can potentially be fruitfully combined, the lack of systematic difference between these models makes the combination task more challenging.
Anthology ID:
2012.amta-papers.27
Volume:
Proceedings of the 10th Conference of the Association for Machine Translation in the Americas: Research Papers
Month:
October 28-November 1
Year:
2012
Address:
San Diego, California, USA
Venue:
AMTA
SIG:
Publisher:
Association for Machine Translation in the Americas
Note:
Pages:
Language:
URL:
https://aclanthology.org/2012.amta-papers.27
DOI:
Bibkey:
Copy Citation:
PDF:
https://aclanthology.org/2012.amta-papers.27.pdf