MoEBERT: from BERT to Mixture-of-Experts via Importance-Guided Adaptation Simiao Zuo author Qingru Zhang author Chen Liang author Pengcheng He author Tuo Zhao author Weizhu Chen author 2022-07 text Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies Marine Carpuat editor Marie-Catherine de Marneffe editor Ivan Vladimir Meza Ruiz editor Association for Computational Linguistics Seattle, United States conference publication zuo-etal-2022-moebert 10.18653/v1/2022.naacl-main.116 https://aclanthology.org/2022.naacl-main.116/ 2022-07 1610 1623