The Role of Compounds in Human vs. Machine Translation Quality

Kristyna Neumannova, Ondřej Bojar


Abstract
We focus on the production of German compounds in English-to-German manual and automatic translation. On the example of WMT21 news translation test set, we observe that even the best MT systems produce much fewer compounds compared to three independent manual translations. Despite this striking difference, we observe that this insufficiency is not apparent in manual evaluation methods that target the overall translation quality (DA and MQM). Simple automatic methods like BLEU somewhat surprisingly provide a better indication of this quality aspect. Our manual analysis of system outputs, including our freshly trained Transformer models, confirms that current deep neural systems operating at the level of subword units are capable of constructing novel words, including novel compounds. This effect however cannot be measured using static dictionaries of compounds such as GermaNet. German compounds thus pose an interesting challenge for future development of MT systems.
Anthology ID:
2023.mtsummit-research.21
Volume:
Proceedings of Machine Translation Summit XIX, Vol. 1: Research Track
Month:
September
Year:
2023
Address:
Macau SAR, China
Editors:
Masao Utiyama, Rui Wang
Venue:
MTSummit
SIG:
Publisher:
Asia-Pacific Association for Machine Translation
Note:
Pages:
248–260
Language:
URL:
https://aclanthology.org/2023.mtsummit-research.21
DOI:
Bibkey:
Cite (ACL):
Kristyna Neumannova and Ondřej Bojar. 2023. The Role of Compounds in Human vs. Machine Translation Quality. In Proceedings of Machine Translation Summit XIX, Vol. 1: Research Track, pages 248–260, Macau SAR, China. Asia-Pacific Association for Machine Translation.
Cite (Informal):
The Role of Compounds in Human vs. Machine Translation Quality (Neumannova & Bojar, MTSummit 2023)
Copy Citation:
PDF:
https://aclanthology.org/2023.mtsummit-research.21.pdf