Linguistic Evaluation for the 2021 State-of-the-art Machine Translation Systems for German to English and English to German

Vivien Macketanz, Eleftherios Avramidis, Shushen Manakhimova, Sebastian Möller


Abstract
We are using a semi-automated test suite in order to provide a fine-grained linguistic evaluation for state-of-the-art machine translation systems. The evaluation includes 18 German to English and 18 English to German systems, submitted to the Translation Shared Task of the 2021 Conference on Machine Translation. Our submission adds up to the submissions of the previous years by creating and applying a wide-range test suite for English to German as a new language pair. The fine-grained evaluation allows spotting significant differences between systems that cannot be distinguished by the direct assessment of the human evaluation campaign. We find that most of the systems achieve good accuracies in the majority of linguistic phenomena but there are few phenomena with lower accuracy, such as the idioms, the modal pluperfect and the German resultative predicates. Two systems have significantly better test suite accuracy in macro-average in every language direction, Online-W and Facebook-AI for German to English and VolcTrans and Online-W for English to German. The systems show a steady improvement as compared to previous years.
Anthology ID:
2021.wmt-1.115
Volume:
Proceedings of the Sixth Conference on Machine Translation
Month:
November
Year:
2021
Address:
Online
Editors:
Loic Barrault, Ondrej Bojar, Fethi Bougares, Rajen Chatterjee, Marta R. Costa-jussa, Christian Federmann, Mark Fishel, Alexander Fraser, Markus Freitag, Yvette Graham, Roman Grundkiewicz, Paco Guzman, Barry Haddow, Matthias Huck, Antonio Jimeno Yepes, Philipp Koehn, Tom Kocmi, Andre Martins, Makoto Morishita, Christof Monz
Venue:
WMT
SIG:
SIGMT
Publisher:
Association for Computational Linguistics
Note:
Pages:
1059–1073
Language:
URL:
https://aclanthology.org/2021.wmt-1.115
DOI:
Bibkey:
Cite (ACL):
Vivien Macketanz, Eleftherios Avramidis, Shushen Manakhimova, and Sebastian Möller. 2021. Linguistic Evaluation for the 2021 State-of-the-art Machine Translation Systems for German to English and English to German. In Proceedings of the Sixth Conference on Machine Translation, pages 1059–1073, Online. Association for Computational Linguistics.
Cite (Informal):
Linguistic Evaluation for the 2021 State-of-the-art Machine Translation Systems for German to English and English to German (Macketanz et al., WMT 2021)
Copy Citation:
PDF:
https://aclanthology.org/2021.wmt-1.115.pdf