A Survey on Model Compression for Large Language Models Xunyu Zhu author Jian Li author Yong Liu author Can Ma author Weiping Wang author 2024 text journal article Transactions of the Association for Computational Linguistics continuing MIT Press Cambridge, MA periodical academic journal zhu-etal-2024-survey-model 10.1162/tacl_a_00704 https://aclanthology.org/2024.tacl-1.85/ 2024 12 1556 1577