Hindi-Marathi Cross Lingual Model

Sahinur Rahman Laskar; Abdullah Faiz Ur Rahman Khilji; Partha Pakray; Sivaji Bandyopadhyay

Hindi-Marathi Cross Lingual Model

Sahinur Rahman Laskar, Abdullah Faiz Ur Rahman Khilji, Partha Pakray, Sivaji Bandyopadhyay

Abstract

Machine Translation (MT) is a vital tool for aiding communication between linguistically separate groups of people. The neural machine translation (NMT) based approaches have gained widespread acceptance because of its outstanding performance. We have participated in WMT20 shared task of similar language translation on Hindi-Marathi pair. The main challenge of this task is by utilization of monolingual data and similarity features of similar language pair to overcome the limitation of available parallel data. In this work, we have implemented NMT based model that simultaneously learns bilingual embedding from both the source and target language pairs. Our model has achieved Hindi to Marathi bilingual evaluation understudy (BLEU) score of 11.59, rank-based intuitive bilingual evaluation score (RIBES) score of 57.76 and translation edit rate (TER) score of 79.07 and Marathi to Hindi BLEU score of 15.44, RIBES score of 61.13 and TER score of 75.96.

Anthology ID:: 2020.wmt-1.45
Volume:: Proceedings of the Fifth Conference on Machine Translation
Month:: November
Year:: 2020
Address:: Online
Editors:: Loïc Barrault, Ondřej Bojar, Fethi Bougares, Rajen Chatterjee, Marta R. Costa-jussà, Christian Federmann, Mark Fishel, Alexander Fraser, Yvette Graham, Paco Guzman, Barry Haddow, Matthias Huck, Antonio Jimeno Yepes, Philipp Koehn, André Martins, Makoto Morishita, Christof Monz, Masaaki Nagata, Toshiaki Nakazawa, Matteo Negri
Venue:: WMT
SIG:: SIGMT
Publisher:: Association for Computational Linguistics
Note:
Pages:: 396–401
Language:
URL:: https://aclanthology.org/2020.wmt-1.45
DOI:
Bibkey:
Cite (ACL):: Sahinur Rahman Laskar, Abdullah Faiz Ur Rahman Khilji, Partha Pakray, and Sivaji Bandyopadhyay. 2020. Hindi-Marathi Cross Lingual Model. In Proceedings of the Fifth Conference on Machine Translation, pages 396–401, Online. Association for Computational Linguistics.
Cite (Informal):: Hindi-Marathi Cross Lingual Model (Laskar et al., WMT 2020)
Copy Citation:
PDF:: https://aclanthology.org/2020.wmt-1.45.pdf
Video:: https://slideslive.com/38939611

PDF Cite Search Video