Empirical methods for MT lexicon development

I. Dan Melamed


Abstract
This article reviews some recently invented methods for automatically extracting translation lexicons from parallel texts. The accuracy of these methods has been significantly improved by exploiting known properties of parallel texts and of particular language pairs. The state of the art has advanced to the point where non-compositional compounds can be automatically identified with high reliability, and their translations can be found. Most importantly, all of these methods can be smoothly integrated into the usual work ow of MT system developers. Semi-automatic MT lexicon construction is likely to be more efficient and more accurate than either fully automatic or fully manual methods alone.
Anthology ID:
1998.amta-papers.2
Volume:
Proceedings of the Third Conference of the Association for Machine Translation in the Americas: Technical Papers
Month:
October 28-31
Year:
1998
Address:
Langhorne, PA, USA
Editors:
David Farwell, Laurie Gerber, Eduard Hovy
Venue:
AMTA
SIG:
Publisher:
Springer
Note:
Pages:
18–30
Language:
URL:
https://link.springer.com/chapter/10.1007/3-540-49478-2_2
DOI:
Bibkey:
Cite (ACL):
I. Dan Melamed. 1998. Empirical methods for MT lexicon development. In Proceedings of the Third Conference of the Association for Machine Translation in the Americas: Technical Papers, pages 18–30, Langhorne, PA, USA. Springer.
Cite (Informal):
Empirical methods for MT lexicon development (Melamed, AMTA 1998)
Copy Citation:
PDF:
https://link.springer.com/chapter/10.1007/3-540-49478-2_2