A Comprehensive Evaluation of Quantization Strategies for Large Language Models Renren Jin author Jiangcun Du author Wuwei Huang author Wei Liu author Jian Luan author Bin Wang author Deyi Xiong author 2024-08 text Findings of the Association for Computational Linguistics: ACL 2024 Lun-Wei Ku editor Andre Martins editor Vivek Srikumar editor Association for Computational Linguistics Bangkok, Thailand conference publication jin-etal-2024-comprehensive 10.18653/v1/2024.findings-acl.726 https://aclanthology.org/2024.findings-acl.726/ 2024-08 12186 12215