A unified statistical model for generalized translation memory system

Jin-Xia Huang, Wei Wang, Ming Zhou


Abstract
We introduced, for Translation Memory System, a statistical framework, which unifies the different phases in a Translation Memory System by letting them constrain each other, and enables Translation Memory System a statistical qualification. Compared to traditional Translation Memory Systems, our model operates at a fine grained sub-sentential level such that it improves the translation coverage. Compared with other approaches that exploit sub-sentential benefits, it unifies the processes of source string segmentation, best example selection, and translation generation by making them constrain each other via the statistical confidence of each step. We realized this framework into a prototype system. Compared with an existing product Translation Memory System, our system exhibits obviously better performance in the "assistant quality metric" and gains improvements in the range of 26.3% to 55.1% in the "translation efficiency metric".
Anthology ID:
2003.mtsummit-papers.23
Volume:
Proceedings of Machine Translation Summit IX: Papers
Month:
September 23-27
Year:
2003
Address:
New Orleans, USA
Venue:
MTSummit
SIG:
Publisher:
Note:
Pages:
Language:
URL:
https://aclanthology.org/2003.mtsummit-papers.23
DOI:
Bibkey:
Cite (ACL):
Jin-Xia Huang, Wei Wang, and Ming Zhou. 2003. A unified statistical model for generalized translation memory system. In Proceedings of Machine Translation Summit IX: Papers, New Orleans, USA.
Cite (Informal):
A unified statistical model for generalized translation memory system (Huang et al., MTSummit 2003)
Copy Citation:
PDF:
https://aclanthology.org/2003.mtsummit-papers.23.pdf