Awakening Dormant Experts:Counterfactual Routing to Mitigate MoE Hallucinations

Wentao Hu; Yanbo Zhai; Xiaohui Hu; Mingkuan Zhao; Shanhong yu; Xue Liu; Kaidong Yu; Shuangyong Song (宋双永); Xuelong Li

Awakening Dormant Experts:Counterfactual Routing to Mitigate MoE Hallucinations

Wentao Hu, Yanbo Zhai, Xiaohui Hu, Mingkuan Zhao, Shanhong yu, Xue Liu, Kaidong Yu, Shuangyong Song, Xuelong Li

Abstract

Sparse Mixture-of-Experts (MoE) models have achieved remarkable scalability, yet they remain vulnerable to hallucinations, particularly when processing long-tail knowledge. We identify that this fragility stems from static Top-k routing: routers tend to favor high-frequency patterns over rare factual associations. Consequently, "specialist experts" possessing critical long-tail knowledge are often assigned low gating scores and remain "dormant"—under-prioritized for specific tokens despite their proven causal importance on other inputs. To address this, we propose Counterfactual Routing (CoR), a training-free inference framework designed to awaken these dormant experts. CoR integrates layer-wise perturbation analysis with the Counterfactual Expert Impact (CEI) metric to dynamically shift computational resources from syntax-dominant to knowledge-intensive layers while maintaining a constant total activation count, effectively retrieving causally decisive experts via virtual ablation. Extensive experiments on TruthfulQA, FACTOR, and TriviaQA demonstrate that CoR improves factual accuracy by 3.1% on average without increasing the inference budget, establishing a superior Pareto frontier compared to static scaling strategies.

Anthology ID:: 2026.acl-long.2187
Volume:: Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Month:: July
Year:: 2026
Address:: San Diego, California, United States
Editors:: Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:: ACL
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 47257–47271
Language:
URL:: https://aclanthology.org/2026.acl-long.2187/
DOI:
Bibkey:
Cite (ACL):: Wentao Hu, Yanbo Zhai, Xiaohui Hu, Mingkuan Zhao, Shanhong yu, Xue Liu, Kaidong Yu, Shuangyong Song, and Xuelong Li. 2026. Awakening Dormant Experts:Counterfactual Routing to Mitigate MoE Hallucinations. In Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 47257–47271, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):: Awakening Dormant Experts:Counterfactual Routing to Mitigate MoE Hallucinations (Hu et al., ACL 2026)
Copy Citation:
PDF:: https://aclanthology.org/2026.acl-long.2187.pdf
Checklist:: 2026.acl-long.2187.checklist.pdf

PDF Cite Search Checklist Fix data