Retaining Key Information under High Compression Ratios: Query-Guided Compressor for LLMs Zhiwei Cao author Qian Cao author Yu Lu author Ningxin Peng author Luyang Huang author Shanbo Cheng author Jinsong Su author 2024-08 text Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) Lun-Wei Ku editor Andre Martins editor Vivek Srikumar editor Association for Computational Linguistics Bangkok, Thailand conference publication cao-etal-2024-retaining 10.18653/v1/2024.acl-long.685 https://aclanthology.org/2024.acl-long.685/ 2024-08 12685 12695