Retrieval Heads are Dynamic

Yuping Lin; Zitao Li; Yue Xing; Pengfei He; Yingqian Cui; Yaliang Li; Bolin Ding; Jingren Zhou; Jiliang Tang

Retrieval Heads are Dynamic

Yuping Lin, Zitao Li, Yue Xing, Pengfei He, Yingqian Cui, Yaliang Li, Bolin Ding, Jingren Zhou, Jiliang Tang

Abstract

Recent studies have identified "retrieval heads" in Large Language Models (LLMs) responsible for extracting information from input contexts. However, prior works largely rely on static statistics aggregated across datasets, identifying heads that perform retrieval on average. This perspective overlooks the fine-grained temporal dynamics of autoregressive generation. In this paper, we investigate retrieval heads from a dynamic perspective. Through extensive analysis, we establish three core claims: (1) Dynamism: Retrieval heads vary dynamically across timesteps; (2) Irreplaceability: Dynamic retrieval heads are specific at each timestep and cannot be effectively replaced by static retrieval heads; and (3) Correlation: The model’s hidden state encodes a predictive signal for future retrieval head patterns, indicating an internal planning mechanism. We validate these findings on the Needle-in-a-Haystack task and a multi-hop QA task, and quantify the differences on the utility of dynamic and static retrieval heads in a Dynamic Retrieval-Augmented Generation framework. Our study provides new insights into the internal mechanisms of LLMs.

Anthology ID:: 2026.acl-long.715
Volume:: Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Month:: July
Year:: 2026
Address:: San Diego, California, United States
Editors:: Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:: ACL
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 15710–15729
Language:
URL:: https://aclanthology.org/2026.acl-long.715/
DOI:
Bibkey:
Cite (ACL):: Yuping Lin, Zitao Li, Yue Xing, Pengfei He, Yingqian Cui, Yaliang Li, Bolin Ding, Jingren Zhou, and Jiliang Tang. 2026. Retrieval Heads are Dynamic. In Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 15710–15729, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):: Retrieval Heads are Dynamic (Lin et al., ACL 2026)
Copy Citation:
PDF:: https://aclanthology.org/2026.acl-long.715.pdf
Checklist:: 2026.acl-long.715.checklist.pdf

PDF Cite Search Checklist Fix data