Test-time Adaptation for Machine Translation Evaluation by Uncertainty Minimization

Runzhe Zhan; Xuebo Liu; Derek F. Wong; Cuilian Zhang; Lidia S. Chao; Min Zhang

doi:10.18653/v1/2023.acl-long.47

Test-time Adaptation for Machine Translation Evaluation by Uncertainty Minimization

Runzhe Zhan, Xuebo Liu, Derek F. Wong, Cuilian Zhang, Lidia S. Chao, Min Zhang

Abstract

The neural metrics recently received considerable attention from the research community in the automatic evaluation of machine translation. Unlike text-based metrics that have interpretable and consistent evaluation mechanisms for various data sources, the reliability of neural metrics in assessing out-of-distribution data remains a concern due to the disparity between training data and real-world data. This paper aims to address the inference bias of neural metrics through uncertainty minimization during test time, without requiring additional data. Our proposed method comprises three steps: uncertainty estimation, test-time adaptation, and inference. Specifically, the model employs the prediction uncertainty of the current data as a signal to update a small fraction of parameters during test time and subsequently refine the prediction through optimization. To validate our approach, we apply the proposed method to three representative models and conduct experiments on the WMT21 benchmarks. The results obtained from both in-domain and out-of-distribution evaluations consistently demonstrate improvements in correlation performance across different models. Furthermore, we provide evidence that the proposed method effectively reduces model uncertainty. The code is publicly available at https://github.com/NLP2CT/TaU.

Anthology ID:: 2023.acl-long.47
Volume:: Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Month:: July
Year:: 2023
Address:: Toronto, Canada
Editors:: Anna Rogers, Jordan Boyd-Graber, Naoaki Okazaki
Venue:: ACL
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 807–820
Language:
URL:: https://aclanthology.org/2023.acl-long.47
DOI:: 10.18653/v1/2023.acl-long.47
Bibkey:
Cite (ACL):: Runzhe Zhan, Xuebo Liu, Derek F. Wong, Cuilian Zhang, Lidia S. Chao, and Min Zhang. 2023. Test-time Adaptation for Machine Translation Evaluation by Uncertainty Minimization. In Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 807–820, Toronto, Canada. Association for Computational Linguistics.
Cite (Informal):: Test-time Adaptation for Machine Translation Evaluation by Uncertainty Minimization (Zhan et al., ACL 2023)
Copy Citation:
PDF:: https://aclanthology.org/2023.acl-long.47.pdf
Video:: https://aclanthology.org/2023.acl-long.47.mp4

PDF Cite Search Video