Findings of the WMT 2024 Shared Task on Non-Repetitive Translation

Kazutaka Kinugawa, Hideya Mino, Isao Goto, Naoto Shirai


Abstract
The repetition of words in an English sentence can create a monotonous or awkward impression. In such cases, repetition should be avoided appropriately. To evaluate the performance of machine translation (MT) systems in avoiding such repetition and outputting more polished translations, we presented the shared task of controlling the lexical choice of MT systems. From Japanese–English parallel news articles, we collected several hundred sentence pairs in which the source sentences containing repeated words were translated in a style that avoided repetition. Participants were required to encourage the MT system to output tokens in a non-repetitive manner while maintaining translation quality. We conducted human and automatic evaluations of systems submitted by two teams based on an encoder-decoder Transformer and a large language model, respectively. From the experimental results and analysis, we report a series of findings on this task.
Anthology ID:
2024.wmt-1.60
Volume:
Proceedings of the Ninth Conference on Machine Translation
Month:
November
Year:
2024
Address:
Miami, Florida, USA
Editors:
Barry Haddow, Tom Kocmi, Philipp Koehn, Christof Monz
Venue:
WMT
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
715–727
Language:
URL:
https://aclanthology.org/2024.wmt-1.60
DOI:
Bibkey:
Cite (ACL):
Kazutaka Kinugawa, Hideya Mino, Isao Goto, and Naoto Shirai. 2024. Findings of the WMT 2024 Shared Task on Non-Repetitive Translation. In Proceedings of the Ninth Conference on Machine Translation, pages 715–727, Miami, Florida, USA. Association for Computational Linguistics.
Cite (Informal):
Findings of the WMT 2024 Shared Task on Non-Repetitive Translation (Kinugawa et al., WMT 2024)
Copy Citation:
PDF:
https://aclanthology.org/2024.wmt-1.60.pdf