IRLAB-DAIICT@DravidianLangTech-EACL2021: Neural Machine Translation

Raj Prajapati, Vedant Vijay Parikh, Prasenjit Majumder


Abstract
This paper describes our team’s submission of the EACL DravidianLangTech-2021’s shared task on Machine Translation of Dravidian languages. We submitted our translations for English to Malayalam , Tamil , Telugu and also Tamil-Telugu language pairs. The submissions mainly focus on having adequate amount of data backed up by good preprocessing of it to produce quality translations,which includes some custom made rules to remove unnecessary sentences. We conducted several experiments on these models by tweaking the architecture,Byte Pair Encoding (BPE) and other hyperparameters.
Anthology ID:
2021.dravidianlangtech-1.36
Volume:
Proceedings of the First Workshop on Speech and Language Technologies for Dravidian Languages
Month:
April
Year:
2021
Address:
Kyiv
Editors:
Bharathi Raja Chakravarthi, Ruba Priyadharshini, Anand Kumar M, Parameswari Krishnamurthy, Elizabeth Sherly
Venue:
DravidianLangTech
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
262–265
Language:
URL:
https://aclanthology.org/2021.dravidianlangtech-1.36
DOI:
Bibkey:
Cite (ACL):
Raj Prajapati, Vedant Vijay Parikh, and Prasenjit Majumder. 2021. IRLAB-DAIICT@DravidianLangTech-EACL2021: Neural Machine Translation. In Proceedings of the First Workshop on Speech and Language Technologies for Dravidian Languages, pages 262–265, Kyiv. Association for Computational Linguistics.
Cite (Informal):
IRLAB-DAIICT@DravidianLangTech-EACL2021: Neural Machine Translation (Prajapati et al., DravidianLangTech 2021)
Copy Citation:
PDF:
https://aclanthology.org/2021.dravidianlangtech-1.36.pdf
Software:
 2021.dravidianlangtech-1.36.Software.zip