Empirical Analysis of Decoding Biases in Masked Diffusion Models

Pengcheng Huang; Tianming Liu; Zhenghao Liu (刘正皓); Yukun Yan (闫宇坤); Shuo Wang; Tong Xiao (肖桐); Zulong Chen; Maosong Sun (孙茂松)

Empirical Analysis of Decoding Biases in Masked Diffusion Models

Pengcheng Huang, Tianming Liu, Zhenghao Liu, Yukun Yan, Shuo Wang, Tong Xiao, Zulong Chen, Maosong Sun

Abstract

Masked Diffusion Models (MDMs) have recently emerged as a promising non-autoregressive paradigm for sequence generation. However, their performance is highly sensitive to the choice of decoding strategy. In this work, we reveal that prevalent uncertainty-based decoding strategies induce two decoding biases in MDMs: rigid boundary bias and trivial token bias. These biases limit the model’s reasoning ability and ultimately degrade generation quality. To address these challenges, we propose UNmasking Calibration for DecOding DEbiasing (UNCODE), a decoding calibration framework that regularizes uncertainty-based decoding by incorporating two complementary priors to shape global decoding trajectories and promote content informativeness. Extensive experiments on three advanced MDMs across seven reasoning- and planning-intensive benchmarks demonstrate that UNCODE consistently outperforms existing decoding strategies by more than 7%, while achieving performance comparable to autoregressive models of similar parameter scales. Our code will be made publicly available on GitHub.

Anthology ID:: 2026.acl-long.311
Volume:: Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Month:: July
Year:: 2026
Address:: San Diego, California, United States
Editors:: Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:: ACL
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 6853–6876
Language:
URL:: https://aclanthology.org/2026.acl-long.311/
DOI:
Bibkey:
Cite (ACL):: Pengcheng Huang, Tianming Liu, Zhenghao Liu, Yukun Yan, Shuo Wang, Tong Xiao, Zulong Chen, and Maosong Sun. 2026. Empirical Analysis of Decoding Biases in Masked Diffusion Models. In Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 6853–6876, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):: Empirical Analysis of Decoding Biases in Masked Diffusion Models (Huang et al., ACL 2026)
Copy Citation:
PDF:: https://aclanthology.org/2026.acl-long.311.pdf
Checklist:: 2026.acl-long.311.checklist.pdf

PDF Cite Search Checklist Fix data