Pan Liu
2021
TenTrans Multilingual Low-Resource Translation System for WMT21 Indo-European Languages Task
Han Yang
|
Bojie Hu
|
Wanying Xie
|
Ambyera Han
|
Pan Liu
|
Jinan Xu
|
Qi Ju
Proceedings of the Sixth Conference on Machine Translation
This paper describes TenTrans’ submission to WMT21 Multilingual Low-Resource Translation shared task for the Romance language pairs. This task focuses on improving translation quality from Catalan to Occitan, Romanian and Italian, with the assistance of related high-resource languages. We mainly utilize back-translation, pivot-based methods, multilingual models, pre-trained model fine-tuning, and in-domain knowledge transfer to improve the translation quality. On the test set, our best-submitted system achieves an average of 43.45 case-sensitive BLEU scores across all low-resource pairs. Our data, code, and pre-trained models used in this work are available in TenTrans evaluation examples.
Search
Co-authors
- Han Yang 1
- Bojie Hu 1
- Wanying Xie 1
- Ambyera Han 1
- Jinan Xu 1
- show all...
- Qi Ju 1
Venues
- wmt1