System Description of BV-SLP for Sindhi-English Machine Translation in MultiIndic22MT 2024 Shared Task

Nisheeth Joshi, Pragya Katyayan, Palak Arora, Bharti Nathani


Abstract
This paper presents our machine translation system that was developed for the WAT2024 MultiInidc MT shared task. We built our system for the Sindhi-English language pair. We developed two MT systems. The first system was our baseline system where Sindhi was translated into English. In the second system we used Hindi as a pivot for the translation of text. In both the cases we had identified the name entities and translated them into English as a preprocessing step. Once this was done, the standard NMT process was followed to train and generate MT outputs for the task. The systems were tested on the hidden dataset of the shared task
Anthology ID:
2024.wmt-1.72
Volume:
Proceedings of the Ninth Conference on Machine Translation
Month:
November
Year:
2024
Address:
Miami, Florida, USA
Editors:
Barry Haddow, Tom Kocmi, Philipp Koehn, Christof Monz
Venue:
WMT
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
793–796
Language:
URL:
https://aclanthology.org/2024.wmt-1.72
DOI:
Bibkey:
Cite (ACL):
Nisheeth Joshi, Pragya Katyayan, Palak Arora, and Bharti Nathani. 2024. System Description of BV-SLP for Sindhi-English Machine Translation in MultiIndic22MT 2024 Shared Task. In Proceedings of the Ninth Conference on Machine Translation, pages 793–796, Miami, Florida, USA. Association for Computational Linguistics.
Cite (Informal):
System Description of BV-SLP for Sindhi-English Machine Translation in MultiIndic22MT 2024 Shared Task (Joshi et al., WMT 2024)
Copy Citation:
PDF:
https://aclanthology.org/2024.wmt-1.72.pdf