HW-TSC 2023 Submission for the Quality Estimation Shared Task

Yuang Li; Chang Su; Ming Zhu; Mengyao Piao; Xinglin Lyu; Min Zhang (张民); Hao Yang (杨浩)

doi:10.18653/v1/2023.wmt-1.72

HW-TSC 2023 Submission for the Quality Estimation Shared Task

Yuang Li, Chang Su, Ming Zhu, Mengyao Piao, Xinglin Lyu, Min Zhang, Hao Yang

Abstract

Quality estimation (QE) is an essential technique to assess machine translation quality without reference translations. In this paper, we focus on Huawei Translation Services Center’s (HW-TSC’s) submission to the sentence-level QE shared task, named Ensemble-CrossQE. Our system uses CrossQE, the same model architecture as our last year’s submission, which consists of a multilingual base model and a task-specific downstream layer. The input is the concatenation of the source and the translated sentences. To enhance the performance, we finetuned and ensembled multiple base models such as XLM-R, InfoXLM, RemBERT and CometKiwi. Moreover, we introduce a new corruption-based data augmentation method, which generates deletion, substitution and insertion errors in the original translation and uses a reference-based QE model to obtain pseudo scores. Results show that our system achieves impressive performance on sentence-level QE test sets and ranked the first place for three language pairs: English-Hindi, English-Tamil and English-Telegu. In addition, we participated in the error span detection task. The submitted model outperforms the baseline on Chinese-English and Hebrew-English language pairs.

Anthology ID:: 2023.wmt-1.72
Volume:: Proceedings of the Eighth Conference on Machine Translation
Month:: December
Year:: 2023
Address:: Singapore
Editors:: Philipp Koehn, Barry Haddow, Tom Kocmi, Christof Monz
Venue:: WMT
SIG:: SIGMT
Publisher:: Association for Computational Linguistics
Note:
Pages:: 835–840
Language:
URL:: https://aclanthology.org/2023.wmt-1.72/
DOI:: 10.18653/v1/2023.wmt-1.72
Bibkey:
Cite (ACL):: Yuang Li, Chang Su, Ming Zhu, Mengyao Piao, Xinglin Lyu, Min Zhang, and Hao Yang. 2023. HW-TSC 2023 Submission for the Quality Estimation Shared Task. In Proceedings of the Eighth Conference on Machine Translation, pages 835–840, Singapore. Association for Computational Linguistics.
Cite (Informal):: HW-TSC 2023 Submission for the Quality Estimation Shared Task (Li et al., WMT 2023)
Copy Citation:
PDF:: https://aclanthology.org/2023.wmt-1.72.pdf

PDF Cite Search Fix data