On the Benefits of Learning to Route in Mixture-of-Experts Models Nishanth Dikkala author Nikhil Ghosh author Raghu Meka author Rina Panigrahy author Nikhil Vyas author Xin Wang author 2023-12 text Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing Houda Bouamor editor Juan Pino editor Kalika Bali editor Association for Computational Linguistics Singapore conference publication dikkala-etal-2023-benefits 10.18653/v1/2023.emnlp-main.583 https://aclanthology.org/2023.emnlp-main.583/ 2023-12 9376 9396