WAT2019: English-Hindi Translation on Hindi Visual Genome Dataset

Loitongbam Sanayai Meetei, Thoudam Doren Singh, Sivaji Bandyopadhyay


Abstract
A multimodal translation is a task of translating a source language to a target language with the help of a parallel text corpus paired with images that represent the contextual details of the text. In this paper, we carried out an extensive comparison to evaluate the benefits of using a multimodal approach on translating text in English to a low resource language, Hindi as a part of WAT2019 shared task. We carried out the translation of English to Hindi in three separate tasks with both the evaluation and challenge dataset. First, by using only the parallel text corpora, then through an image caption generation approach and, finally with the multimodal approach. Our experiment shows a significant improvement in the result with the multimodal approach than the other approach.
Anthology ID:
D19-5224
Volume:
Proceedings of the 6th Workshop on Asian Translation
Month:
November
Year:
2019
Address:
Hong Kong, China
Editors:
Toshiaki Nakazawa, Chenchen Ding, Raj Dabre, Anoop Kunchukuttan, Nobushige Doi, Yusuke Oda, Ondřej Bojar, Shantipriya Parida, Isao Goto, Hidaya Mino
Venue:
WAT
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
181–188
Language:
URL:
https://aclanthology.org/D19-5224
DOI:
10.18653/v1/D19-5224
Bibkey:
Cite (ACL):
Loitongbam Sanayai Meetei, Thoudam Doren Singh, and Sivaji Bandyopadhyay. 2019. WAT2019: English-Hindi Translation on Hindi Visual Genome Dataset. In Proceedings of the 6th Workshop on Asian Translation, pages 181–188, Hong Kong, China. Association for Computational Linguistics.
Cite (Informal):
WAT2019: English-Hindi Translation on Hindi Visual Genome Dataset (Sanayai Meetei et al., WAT 2019)
Copy Citation:
PDF:
https://aclanthology.org/D19-5224.pdf