LLMC: Benchmarking Large Language Model Quantization with a Versatile Compression Toolkit Ruihao Gong author Yang Yong author Shiqiao Gu author Yushi Huang author Chengtao Lv author Yunchen Zhang author Dacheng Tao author Xianglong Liu author 2024-11 text Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing: Industry Track Franck Dernoncourt editor Daniel Preoţiuc-Pietro editor Anastasia Shimorina editor Association for Computational Linguistics Miami, Florida, US conference publication gong-etal-2024-llmc 10.18653/v1/2024.emnlp-industry.12 https://aclanthology.org/2024.emnlp-industry.12/ 2024-11 132 152