Understanding and Addressing the Under-Translation Problem from the Perspective of Decoding Objective

Chenze Shao, Fandong Meng, Jiali Zeng, Jie Zhou


Abstract
Neural Machine Translation (NMT) has made remarkable progress over the past years. However, under-translation and over-translation remain two challenging problems in state-of-the-art NMT systems. In this work, we conduct an in-depth analysis on the underlying cause of under-translation in NMT, providing an explanation from the perspective of decoding objective. To optimize the beam search objective, the model tends to overlook words it is less confident about, leading to the under-translation phenomenon. Correspondingly, the model’s confidence in predicting the End Of Sentence (EOS) diminishes when under-translation occurs, serving as a mild penalty for under-translated candidates. Building upon this analysis, we propose employing the confidence of predicting EOS as a detector for under-translation, and strengthening the confidence-based penalty to penalize candidates with a high risk of under-translation.Experiments on both synthetic and real-world data show that our method can accurately detect and rectify under-translated outputs, with minor impact on other correct translations.
Anthology ID:
2024.luhme-long.209
Volume:
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Month:
August
Year:
2024
Address:
Bangkok, Thailand
Editors:
Lun-Wei Ku, Andre Martins, Vivek Srikumar
Venue:
ACL
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
3800–3814
Language:
URL:
https://aclanthology.org/2024.luhme-long.209/
DOI:
10.18653/v1/2024.acl-long.209
Bibkey:
Cite (ACL):
Chenze Shao, Fandong Meng, Jiali Zeng, and Jie Zhou. 2024. Understanding and Addressing the Under-Translation Problem from the Perspective of Decoding Objective. In Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 3800–3814, Bangkok, Thailand. Association for Computational Linguistics.
Cite (Informal):
Understanding and Addressing the Under-Translation Problem from the Perspective of Decoding Objective (Shao et al., ACL 2024)
Copy Citation:
PDF:
https://aclanthology.org/2024.acl-long.209.pdf