Implementing a neural machine translation engine for mobile devices: the Lingvanex use case

Zuzanna Parcheta, Germán Sanchis-Trilles, Aliaksei Rudak, Siarhei Bratchenia


Abstract
In this paper, we present the challenge entailed by implementing a mobile version of a neural machine translation system, where the goal is to maximise translation quality while minimising model size. We explain the whole process of implementing the translation engine on an English–Spanish example and we describe all the difficulties found and the solutions implemented. The main techniques used in this work are data selection by means of Infrequent n-gram Recovery, appending a special word at the end of each sentence, and generating additional samples without the final punctuation marks. The last two techniques were devised with the purpose of achieving a translation model that generates sentences without the final full stop, or other punctuation marks. Also, in this work, the Infrequent n-gram Recovery was used for the first time to create a new corpus, and not enlarge the in-domain dataset. Finally, we get a small size model with quality good enough to serve for daily use.
Anthology ID:
2018.eamt-main.31
Volume:
Proceedings of the 21st Annual Conference of the European Association for Machine Translation
Month:
May
Year:
2018
Address:
Alicante, Spain
Editors:
Juan Antonio Pérez-Ortiz, Felipe Sánchez-Martínez, Miquel Esplà-Gomis, Maja Popović, Celia Rico, André Martins, Joachim Van den Bogaert, Mikel L. Forcada
Venue:
EAMT
SIG:
Publisher:
Note:
Pages:
317–322
Language:
URL:
https://aclanthology.org/2018.eamt-main.31
DOI:
Bibkey:
Cite (ACL):
Zuzanna Parcheta, Germán Sanchis-Trilles, Aliaksei Rudak, and Siarhei Bratchenia. 2018. Implementing a neural machine translation engine for mobile devices: the Lingvanex use case. In Proceedings of the 21st Annual Conference of the European Association for Machine Translation, pages 317–322, Alicante, Spain.
Cite (Informal):
Implementing a neural machine translation engine for mobile devices: the Lingvanex use case (Parcheta et al., EAMT 2018)
Copy Citation:
PDF:
https://aclanthology.org/2018.eamt-main.31.pdf