Quantized Side Tuning: Fast and Memory-Efficient Tuning of Quantized Large Language Models Zhengxin Zhang author Dan Zhao author Xupeng Miao author Gabriele Oliaro author Zhihao Zhang author Qing Li author Yong Jiang author Zhihao Jia author 2024-08 text Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) Lun-Wei Ku editor Andre Martins editor Vivek Srikumar editor Association for Computational Linguistics Bangkok, Thailand conference publication zhang-etal-2024-quantized 10.18653/v1/2024.acl-long.1 https://aclanthology.org/2024.acl-long.1/ 2024-08 1 17