Modular Transformers: Compressing Transformers into Modularized Layers for Flexible Efficient Inference Wangchunshu Zhou author Ronan Le Bras author Yejin Choi author 2023-07 text Findings of the Association for Computational Linguistics: ACL 2023 Anna Rogers editor Jordan Boyd-Graber editor Naoaki Okazaki editor Association for Computational Linguistics Toronto, Canada conference publication zhou-etal-2023-modular 10.18653/v1/2023.findings-acl.664 https://aclanthology.org/2023.findings-acl.664/ 2023-07 10452 10465