A multi-source approach for Breton–French hybrid machine translation

Víctor M. Sánchez-Cartagena, Mikel L. Forcada, Felipe Sánchez-Martínez


Abstract
Corpus-based approaches to machine translation (MT) have difficulties when the amount of parallel corpora to use for training is scarce, especially if the languages involved in the translation are highly inflected. This problem can be addressed from different perspectives, including data augmentation, transfer learning, and the use of additional resources, such as those used in rule-based MT. This paper focuses on the hybridisation of rule-based MT and neural MT for the Breton–French under-resourced language pair in an attempt to study to what extent the rule-based MT resources help improve the translation quality of the neural MT system for this particular under-resourced language pair. We combine both translation approaches in a multi-source neural MT architecture and find out that, even though the rule-based system has a low performance according to automatic evaluation metrics, using it leads to improved translation quality.
Anthology ID:
2020.eamt-1.8
Volume:
Proceedings of the 22nd Annual Conference of the European Association for Machine Translation
Month:
November
Year:
2020
Address:
Lisboa, Portugal
Editors:
André Martins, Helena Moniz, Sara Fumega, Bruno Martins, Fernando Batista, Luisa Coheur, Carla Parra, Isabel Trancoso, Marco Turchi, Arianna Bisazza, Joss Moorkens, Ana Guerberof, Mary Nurminen, Lena Marg, Mikel L. Forcada
Venue:
EAMT
SIG:
Publisher:
European Association for Machine Translation
Note:
Pages:
61–70
Language:
URL:
https://aclanthology.org/2020.eamt-1.8
DOI:
Bibkey:
Cite (ACL):
Víctor M. Sánchez-Cartagena, Mikel L. Forcada, and Felipe Sánchez-Martínez. 2020. A multi-source approach for Breton–French hybrid machine translation. In Proceedings of the 22nd Annual Conference of the European Association for Machine Translation, pages 61–70, Lisboa, Portugal. European Association for Machine Translation.
Cite (Informal):
A multi-source approach for Breton–French hybrid machine translation (Sánchez-Cartagena et al., EAMT 2020)
Copy Citation:
PDF:
https://aclanthology.org/2020.eamt-1.8.pdf