Using Contextual Information for Machine Translation Evaluation

Marina Fomicheva; Núria Bel

Using Contextual Information for Machine Translation Evaluation

Abstract

Automatic evaluation of Machine Translation (MT) is typically approached by measuring similarity between the candidate MT and a human reference translation. An important limitation of existing evaluation systems is that they are unable to distinguish candidate-reference differences that arise due to acceptable linguistic variation from the differences induced by MT errors. In this paper we present a new metric, UPF-Cobalt, that addresses this issue by taking into consideration the syntactic contexts of candidate and reference words. The metric applies a penalty when the words are similar but the contexts in which they occur are not equivalent. In this way, Machine Translations (MTs) that are different from the human translation but still essentially correct are distinguished from those that share high number of words with the reference but alter the meaning of the sentence due to translation errors. The results show that the method proposed is indeed beneficial for automatic MT evaluation. We report experiments based on two different evaluation tasks with various types of manual quality assessment. The metric significantly outperforms state-of-the-art evaluation systems in varying evaluation settings.

Anthology ID:: L16-1437
Volume:: Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16)
Month:: May
Year:: 2016
Address:: Portorož, Slovenia
Editors:: Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Sara Goggi, Marko Grobelnik, Bente Maegaard, Joseph Mariani, Helene Mazo, Asuncion Moreno, Jan Odijk, Stelios Piperidis
Venue:: LREC
SIG:
Publisher:: European Language Resources Association (ELRA)
Note:
Pages:: 2755–2761
Language:
URL:: https://aclanthology.org/L16-1437/
DOI:
Bibkey:
Cite (ACL):: Marina Fomicheva and Núria Bel. 2016. Using Contextual Information for Machine Translation Evaluation. In Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16), pages 2755–2761, Portorož, Slovenia. European Language Resources Association (ELRA).
Cite (Informal):: Using Contextual Information for Machine Translation Evaluation (Fomicheva & Bel, LREC 2016)
Copy Citation:
PDF:: https://aclanthology.org/L16-1437.pdf

PDF Cite Search Fix data