Boost Transformer-based Language Models with GPU-Friendly Sparsity and Quantization Chong Yu author Tao Chen author Zhongxue Gan author 2023-07 text Findings of the Association for Computational Linguistics: ACL 2023 Anna Rogers editor Jordan Boyd-Graber editor Naoaki Okazaki editor Association for Computational Linguistics Toronto, Canada conference publication yu-etal-2023-boost 10.18653/v1/2023.findings-acl.15 https://aclanthology.org/2023.findings-acl.15/ 2023-07 218 235