Convergence of Translation Memory and Statistical Machine Translation

Philipp Koehn, Jean Senellart


Abstract
We present two methods that merge ideas from statistical machine translation (SMT) and translation memories (TM). We use a TM to retrieve matches for source segments, and replace the mismatched parts with instructions to an SMT system to fill in the gap. We show that for fuzzy matches of over 70%, one method outperforms both SMT and TM baselines.
Anthology ID:
2010.jec-1.4
Volume:
Proceedings of the Second Joint EM+/CNGL Workshop: Bringing MT to the User: Research on Integrating MT in the Translation Industry
Month:
November 4
Year:
2010
Address:
Denver, Colorado, USA
Venues:
AMTA | JEC
SIG:
Publisher:
Association for Machine Translation in the Americas
Note:
Pages:
21–32
Language:
URL:
https://aclanthology.org/2010.jec-1.4
DOI:
Bibkey:
Copy Citation:
PDF:
https://aclanthology.org/2010.jec-1.4.pdf