COST-EFF: Collaborative Optimization of Spatial and Temporal Efficiency with Slenderized Multi-exit Language Models Bowen Shen author Zheng Lin author Yuanxin Liu author Zhengxiao Liu author Lei Wang author Weiping Wang author 2022-12 text Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing Yoav Goldberg editor Zornitsa Kozareva editor Yue Zhang editor Association for Computational Linguistics Abu Dhabi, United Arab Emirates conference publication shen-etal-2022-cost 10.18653/v1/2022.emnlp-main.112 https://aclanthology.org/2022.emnlp-main.112/ 2022-12 1719 1730