Better contextual translation using machine learning

Arul Menezes


Abstract
One of the problems facing translation systems that automatically extract transfer mappings (rules or examples) from bilingual corpora is the trade-off between contextual specificity and general applicability of the mappings, which typically results in conflicting mappings without distinguishing context. We present a machine-learning approach to choosing between such mappings, using classifiers that, in effect, selectively expand the context for these mappings using features available in a linguistic representation of the source language input. We show that using these classifiers in our machine translation system significantly improves the quality of the translated output. Additionally, the set of distinguishing features selected by the classifiers provides insight into the relative importance of the various linguistic features in choosing the correct contextual translation.
Anthology ID:
2002.amta-papers.13
Volume:
Proceedings of the 5th Conference of the Association for Machine Translation in the Americas: Technical Papers
Month:
October 8-12
Year:
2002
Address:
Tiburon, USA
Venue:
AMTA
SIG:
Publisher:
Springer
Note:
Pages:
124–134
Language:
URL:
https://link.springer.com/chapter/10.1007/3-540-45820-4_13
DOI:
Bibkey:
Copy Citation:
PDF:
https://link.springer.com/chapter/10.1007/3-540-45820-4_13