QUITO-X: A New Perspective on Context Compression from the Information Bottleneck Theory

Yihang Wang; Xu Huang; Bowen Tian; Yueyang Su; Lei Yu; Huaming Liao; Yixing Fan; Jiafeng Guo; Xueqi Cheng (程学旗)

doi:10.18653/v1/2025.findings-emnlp.362

QUITO-X: A New Perspective on Context Compression from the Information Bottleneck Theory

Yihang Wang, Xu Huang, Bowen Tian, Yueyang Su, Lei Yu, Huaming Liao, Yixing Fan, Jiafeng Guo, Xueqi Cheng

Abstract

Generative large language models ( LLMs) have achieved remarkable success in various industrial applications, owing to their promising In-Context Learning capabilities. However, the issue of long context in complex tasks poses a significant barrier to their wider adoption, manifested in two main aspects: (i) The excessively long context leads to high costs and inference delays. (ii) A substantial amount of task-irrelevant information introduced by long contexts exacerbates the “lost in the middle” problem. Existing methods compress context by removing redundant tokens using metrics such as self-information or perplexity ( PPL ), which is inconsistent with the objective of retaining the most important tokens when conditioning on a given query. In this study, we introduce information bottleneck theory (IB) to model the problem, offering a novel perspective that thoroughly addresses the essential properties required for context compression. Additionally, we propose a cross-attention-based approach to approximate mutual information in IB, which can be flexibly replaced with suitable alternatives in different scenarios. Extensive experiments on four datasets demonstrate that our method achieves a 25% increase in compression rate compared to the state-of-the-art, while maintaining question answering performance. In particular, the context compressed by our method even outperform the full context in some cases.

Anthology ID:: 2025.findings-emnlp.362
Volume:: Findings of the Association for Computational Linguistics: EMNLP 2025
Month:: November
Year:: 2025
Address:: Suzhou, China
Editors:: Christos Christodoulopoulos, Tanmoy Chakraborty, Carolyn Rose, Violet Peng
Venue:: Findings
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 6841–6856
Language:
URL:: https://aclanthology.org/2025.findings-emnlp.362/
DOI:: 10.18653/v1/2025.findings-emnlp.362
Bibkey:
Cite (ACL):: Yihang Wang, Xu Huang, Bowen Tian, Yueyang Su, Lei Yu, Huaming Liao, Yixing Fan, Jiafeng Guo, and Xueqi Cheng. 2025. QUITO-X: A New Perspective on Context Compression from the Information Bottleneck Theory. In Findings of the Association for Computational Linguistics: EMNLP 2025, pages 6841–6856, Suzhou, China. Association for Computational Linguistics.
Cite (Informal):: QUITO-X: A New Perspective on Context Compression from the Information Bottleneck Theory (Wang et al., Findings 2025)
Copy Citation:
PDF:: https://aclanthology.org/2025.findings-emnlp.362.pdf
Checklist:: 2025.findings-emnlp.362.checklist.pdf

PDF Cite Search Checklist Fix data