Building Multiword Expressions Bilingual Lexicons for Domain Adaptation of an Example-Based Machine Translation System

Nasredine Semmar, Mariama Laib


Abstract
We describe in this paper a hybrid ap-proach to build automatically bilingual lexicons of Multiword Expressions (MWEs) from parallel corpora. We more specifically investigate the impact of using a domain-specific bilingual lexicon of MWEs on domain adaptation of an Example-Based Machine Translation (EBMT) system. We conducted experiments on the English-French language pair and two kinds of texts: in-domain texts from Europarl (European Parliament proceedings) and out-of-domain texts from Emea (European Medicines Agency documents) and Ecb (European Central Bank corpus). The obtained results indicate that integrating domain-specific bilingual lexicons of MWEs improves translation quality of the EBMT system when texts to translate are related to the specific domain and induces a relatively slight deterioration of translation quality when translating general-purpose texts.
Anthology ID:
R17-1085
Volume:
Proceedings of the International Conference Recent Advances in Natural Language Processing, RANLP 2017
Month:
September
Year:
2017
Address:
Varna, Bulgaria
Editors:
Ruslan Mitkov, Galia Angelova
Venue:
RANLP
SIG:
Publisher:
INCOMA Ltd.
Note:
Pages:
661–670
Language:
URL:
https://doi.org/10.26615/978-954-452-049-6_085
DOI:
10.26615/978-954-452-049-6_085
Bibkey:
Cite (ACL):
Nasredine Semmar and Mariama Laib. 2017. Building Multiword Expressions Bilingual Lexicons for Domain Adaptation of an Example-Based Machine Translation System. In Proceedings of the International Conference Recent Advances in Natural Language Processing, RANLP 2017, pages 661–670, Varna, Bulgaria. INCOMA Ltd..
Cite (Informal):
Building Multiword Expressions Bilingual Lexicons for Domain Adaptation of an Example-Based Machine Translation System (Semmar & Laib, RANLP 2017)
Copy Citation:
PDF:
https://doi.org/10.26615/978-954-452-049-6_085