LLM-QAT: Data-Free Quantization Aware Training for Large Language Models Zechun Liu author Barlas Oguz author Changsheng Zhao author Ernie Chang author Pierre Stock author Yashar Mehdad author Yangyang Shi author Raghuraman Krishnamoorthi author Vikas Chandra author 2024-08 text Findings of the Association for Computational Linguistics: ACL 2024 Lun-Wei Ku editor Andre Martins editor Vivek Srikumar editor Association for Computational Linguistics Bangkok, Thailand conference publication liu-etal-2024-llm 10.18653/v1/2024.findings-acl.26 https://aclanthology.org/2024.findings-acl.26/ 2024-08 467 484