Harder Task Needs More Experts: Dynamic Routing in MoE Models Quzhe Huang author Zhenwei An author Nan Zhuang author Mingxu Tao author Chen Zhang author Yang Jin author Kun Xu author Liwei Chen author Songfang Huang author Yansong Feng author 2024-08 text Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) Lun-Wei Ku editor Andre Martins editor Vivek Srikumar editor Association for Computational Linguistics Bangkok, Thailand conference publication huang-etal-2024-harder 10.18653/v1/2024.acl-long.696 https://aclanthology.org/2024.acl-long.696/ 2024-08 12883 12895