Translation Quality Estimation for Indian Languages

Nisarg Jhaveri, Manish Gupta, Vasudeva Varma


Abstract
Translation Quality Estimation (QE) aims to estimate the quality of an automated machine translation (MT) output without any human intervention or reference translation. With the increasing use of MT systems in various cross-lingual applications, the need and applicability of QE systems is increasing. We study existing approaches and propose multiple neural network approaches for sentence-level QE, with a focus on MT outputs in Indian languages. For this, we also introduce five new datasets for four language pairs: two for English–Gujarati, and one each for English–Hindi, English–Telugu and English–Bengali, which includes one manually post-edited dataset for English– Gujarati. These Indian languages are spoken by around 689M speakers world-wide. We compare results obtained using our proposed models with multiple state-of-the-art systems including the winning system in the WMT17 shared task on QE and show that our proposed neural model which combines the discriminative power of carefully chosen features with Siamese Convolutional Neural Networks (CNNs) works best for all Indian language datasets.
Anthology ID:
2018.eamt-main.16
Volume:
Proceedings of the 21st Annual Conference of the European Association for Machine Translation
Month:
May
Year:
2018
Address:
Alicante, Spain
Editors:
Juan Antonio Pérez-Ortiz, Felipe Sánchez-Martínez, Miquel Esplà-Gomis, Maja Popović, Celia Rico, André Martins, Joachim Van den Bogaert, Mikel L. Forcada
Venue:
EAMT
SIG:
Publisher:
Note:
Pages:
179–188
Language:
URL:
https://aclanthology.org/2018.eamt-main.16
DOI:
Bibkey:
Cite (ACL):
Nisarg Jhaveri, Manish Gupta, and Vasudeva Varma. 2018. Translation Quality Estimation for Indian Languages. In Proceedings of the 21st Annual Conference of the European Association for Machine Translation, pages 179–188, Alicante, Spain.
Cite (Informal):
Translation Quality Estimation for Indian Languages (Jhaveri et al., EAMT 2018)
Copy Citation:
PDF:
https://aclanthology.org/2018.eamt-main.16.pdf