Optimizing Multi-Hop Document Retrieval Through Intermediate Representations

Jiaen Lin; Jingyu Liu; Yingbo Liu

doi:10.18653/v1/2025.findings-acl.816

Optimizing Multi-Hop Document Retrieval Through Intermediate Representations

Abstract

Retrieval-augmented generation (RAG) encounters challenges when addressing complex queries, particularly multi-hop questions. While several methods tackle multi-hop queries by iteratively generating internal queries and retrieving external documents, these approaches are computationally expensive. In this paper, we identify a three-stage information processing pattern in LLMs during layer-by-layer reasoning, consisting of extraction, processing, and subsequent extraction steps. This observation suggests that the representations in intermediate layers contain richer information compared to those in other layers. Building on this insight, we propose Layer-wise RAG (L-RAG). Unlike prior methods that focus on generating new internal queries, L-RAG leverages intermediate representations from the middle layers, which capture next-hop information, to retrieve external knowledge. L-RAG achieves performance comparable to multi-step approaches while maintaining inference overhead similar to that of standard RAG. Experimental results show that L-RAG outperforms existing RAG methods on open-domain multi-hop question-answering datasets, including MuSiQue, HotpotQA, and 2WikiMultiHopQA. The code is available in https://anonymous.4open.science/r/L-RAG-ADD5/.

Anthology ID:: 2025.findings-acl.816
Volume:: Findings of the Association for Computational Linguistics: ACL 2025
Month:: July
Year:: 2025
Address:: Vienna, Austria
Editors:: Wanxiang Che, Joyce Nabende, Ekaterina Shutova, Mohammad Taher Pilehvar
Venue:: Findings
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 15798–15809
Language:
URL:: https://aclanthology.org/2025.findings-acl.816/
DOI:: 10.18653/v1/2025.findings-acl.816
Bibkey:
Cite (ACL):: Jiaen Lin, Jingyu Liu, and Yingbo Liu. 2025. Optimizing Multi-Hop Document Retrieval Through Intermediate Representations. In Findings of the Association for Computational Linguistics: ACL 2025, pages 15798–15809, Vienna, Austria. Association for Computational Linguistics.
Cite (Informal):: Optimizing Multi-Hop Document Retrieval Through Intermediate Representations (Lin et al., Findings 2025)
Copy Citation:
PDF:: https://aclanthology.org/2025.findings-acl.816.pdf

PDF Cite Search Fix data