Scaling Vision-Language Models with Sparse Mixture of Experts Sheng Shen author Zhewei Yao author Chunyuan Li author Trevor Darrell author Kurt Keutzer author Yuxiong He author 2023-12 text Findings of the Association for Computational Linguistics: EMNLP 2023 Houda Bouamor editor Juan Pino editor Kalika Bali editor Association for Computational Linguistics Singapore conference publication shen-etal-2023-scaling 10.18653/v1/2023.findings-emnlp.758 https://aclanthology.org/2023.findings-emnlp.758/ 2023-12 11329 11344