Uncertainty-Aware Routing for Principled Alignment with MoE Dynamics

YiLong Chen; Junyuan Shang; Yuchen Feng; Zhenyu Zhang; Naibin Gu; Ziqi Wang; Tingwen Liu; Shuohuan Wang; Yu Sun; Hua Wu (吴华); Haifeng Wang

Uncertainty-Aware Routing for Principled Alignment with MoE Dynamics

Yilong Chen, Junyuan Shang, Yuchen Feng, Zhenyu Zhang, Naibin Gu, Ziqi Wang, Tingwen Liu, Shuohuan Wang, Yu Sun, Hua Wu, Haifeng Wang

Abstract

Mixture-of-Experts (MoE) is a cornerstone for scaling LLMs, yet its training dynamics remain poorly understood, often leading to sub-optimal specialization. Moving beyond static routing, we present a systematic study of the MoE lifecycle using Helmholtz Free Energyand Router Entropy. We identify a universal Three-Stage Phase Transition—Exploration, Symmetry Breaking, and Stabilization—marked by an Energy Climb and Plateau. This reflects Frustrated Exploration, caused by structural interference between specialization drives and uniformity constraints. To address this, we propose Uncertainty-Aware Routing (UAR), which aligns routing with the model’s epistemic state via: (1) Evidence-Triggered Expansion, increasing active experts for high-energy tokens, and (2) Epistemic Masking, applying load-balancing only in high-uncertainty regimes to shield mature experts. Experiments confirm UAR reduces perplexity and improves expert distinctiveness, offering a principled path toward thermodynamically aligned computation.

Anthology ID:: 2026.acl-long.1801
Volume:: Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Month:: July
Year:: 2026
Address:: San Diego, California, United States
Editors:: Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:: ACL
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 38865–38880
Language:
URL:: https://aclanthology.org/2026.acl-long.1801/
DOI:
Bibkey:
Cite (ACL):: Yilong Chen, Junyuan Shang, Yuchen Feng, Zhenyu Zhang, Naibin Gu, Ziqi Wang, Tingwen Liu, Shuohuan Wang, Yu Sun, Hua Wu, and Haifeng Wang. 2026. Uncertainty-Aware Routing for Principled Alignment with MoE Dynamics. In Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 38865–38880, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):: Uncertainty-Aware Routing for Principled Alignment with MoE Dynamics (Chen et al., ACL 2026)
Copy Citation:
PDF:: https://aclanthology.org/2026.acl-long.1801.pdf
Checklist:: 2026.acl-long.1801.checklist.pdf

PDF Cite Search Checklist Fix data