Unveiling the Power of Source: Source-based Minimum Bayes Risk Decoding for Neural Machine Translation

Boxuan Lyu; Hidetaka Kamigaito; Kotaro Funakoshi; Manabu Okumura

doi:10.18653/v1/2025.acl-long.149

Unveiling the Power of Source: Source-based Minimum Bayes Risk Decoding for Neural Machine Translation

Boxuan Lyu, Hidetaka Kamigaito, Kotaro Funakoshi, Manabu Okumura

Abstract

Maximum a posteriori decoding, a commonly used method for neural machine translation (NMT), aims to maximize the estimated posterior probability. However, high estimated probability does not always lead to high translation quality. Minimum Bayes Risk (MBR) decoding offers an alternative by seeking hypotheses with the highest expected utility.Inspired by Quality Estimation (QE) reranking which uses the QE model as a ranker, we propose source-based MBR (sMBR) decoding, a novel approach that utilizes quasi-sources (generated via paraphrasing or back-translation) as “support hypotheses” and a reference-free quality estimation metric as the utility function, marking the first work to solely use sources in MBR decoding. Experiments show that sMBR outperforms QE reranking and the standard MBR decoding. Our findings suggest that sMBR is a promising approach for NMT decoding.

Anthology ID:: 2025.acl-long.149
Volume:: Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Month:: July
Year:: 2025
Address:: Vienna, Austria
Editors:: Wanxiang Che, Joyce Nabende, Ekaterina Shutova, Mohammad Taher Pilehvar
Venue:: ACL
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 2976–2994
Language:
URL:: https://aclanthology.org/2025.acl-long.149/
DOI:: 10.18653/v1/2025.acl-long.149
Bibkey:
Cite (ACL):: Boxuan Lyu, Hidetaka Kamigaito, Kotaro Funakoshi, and Manabu Okumura. 2025. Unveiling the Power of Source: Source-based Minimum Bayes Risk Decoding for Neural Machine Translation. In Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 2976–2994, Vienna, Austria. Association for Computational Linguistics.
Cite (Informal):: Unveiling the Power of Source: Source-based Minimum Bayes Risk Decoding for Neural Machine Translation (Lyu et al., ACL 2025)
Copy Citation:
PDF:: https://aclanthology.org/2025.acl-long.149.pdf

PDF Cite Search Fix data