On the True Distribution Approximation of Minimum Bayes-Risk Decoding

Atsumoto Ohashi, Ukyo Honda, Tetsuro Morimura, Yuu Jinnai


Abstract
Minimum Bayes-risk (MBR) decoding has recently gained renewed attention in text generation.MBR decoding considers texts sampled from a model as pseudo-references and selects the text with the highest similarity to the others.Therefore, sampling is one of the key elements of MBR decoding, and previous studies reported that the performance varies by sampling methods.From a theoretical standpoint, this performance variation is likely tied to how closely the samples approximate the true distribution of references.However, this approximation has not been the subject of in-depth study.In this study, we propose using anomaly detection to measure the degree of approximation.We first closely examine the performance variation and then show that previous hypotheses about samples do not correlate well with the variation, but our introduced anomaly scores do.The results are the first to empirically support the link between the performance and the core assumption of MBR decoding.
Anthology ID:
2024.naacl-short.38
Volume:
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 2: Short Papers)
Month:
June
Year:
2024
Address:
Mexico City, Mexico
Editors:
Kevin Duh, Helena Gomez, Steven Bethard
Venue:
NAACL
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
459–468
Language:
URL:
https://aclanthology.org/2024.naacl-short.38
DOI:
Bibkey:
Cite (ACL):
Atsumoto Ohashi, Ukyo Honda, Tetsuro Morimura, and Yuu Jinnai. 2024. On the True Distribution Approximation of Minimum Bayes-Risk Decoding. In Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 2: Short Papers), pages 459–468, Mexico City, Mexico. Association for Computational Linguistics.
Cite (Informal):
On the True Distribution Approximation of Minimum Bayes-Risk Decoding (Ohashi et al., NAACL 2024)
Copy Citation:
PDF:
https://aclanthology.org/2024.naacl-short.38.pdf