HyperRouter: Towards Efficient Training and Inference of Sparse Mixture of Experts Truong Giang Do author Le Khiem author Quang Pham author TrungTin Nguyen author Thanh-Nam Doan author Binh Nguyen author Chenghao Liu author Savitha Ramasamy author Xiaoli Li author Steven Hoi author 2023-12 text Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing Houda Bouamor editor Juan Pino editor Kalika Bali editor Association for Computational Linguistics Singapore conference publication do-etal-2023-hyperrouter 10.18653/v1/2023.emnlp-main.351 https://aclanthology.org/2023.emnlp-main.351/ 2023-12 5754 5765