Who Says Elephants Can’t Run: Bringing Large Scale MoE Models into Cloud Scale Production Young Jin Kim author Rawn Henry author Raffy Fahim author Hany Hassan author 2022-12 text Proceedings of the Third Workshop on Simple and Efficient Natural Language Processing (SustaiNLP) Angela Fan editor Iryna Gurevych editor Yufang Hou editor Zornitsa Kozareva editor Sasha Luccioni editor Nafise Sadat Moosavi editor Sujith Ravi editor Gyuwan Kim editor Roy Schwartz editor Andreas Rücklé editor Association for Computational Linguistics Abu Dhabi, United Arab Emirates (Hybrid) conference publication kim-etal-2022-says 10.18653/v1/2022.sustainlp-1.6 https://aclanthology.org/2022.sustainlp-1.6/ 2022-12 36 43