Shifting from Ranking to Set Selection for Retrieval Augmented Generation

Dahyun Lee; Yongrae Jo; Haeju Park; Moontae Lee

doi:10.18653/v1/2025.acl-long.861

Shifting from Ranking to Set Selection for Retrieval Augmented Generation

Dahyun Lee, Yongrae Jo, Haeju Park, Moontae Lee

Abstract

Retrieval in Retrieval-Augmented Generation (RAG) must ensure that retrieved passages are not only individually relevant but also collectively form a comprehensive set.Existing approaches primarily rerank top-k passages based on their individual relevance, often failing to meet the information needs of complex queries in multi-hop question answering.In this work, we propose a set-wise passage selection approach and introduce SetR, which explicitly identifies the information requirements of a query through Chain-of-Thought reasoning and selects an optimal set of passages that collectively satisfy those requirements.Experiments on multi-hop RAG benchmarks show that SetR outperforms both proprietary LLM-based rerankers and open-source baselines in terms of answer correctness and retrieval quality, providing an effective and efficient alternative to traditional rerankers in RAG systems.The code is available at https://github.com/LGAI-Research/SetR

Anthology ID:: 2025.acl-long.861
Volume:: Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Month:: July
Year:: 2025
Address:: Vienna, Austria
Editors:: Wanxiang Che, Joyce Nabende, Ekaterina Shutova, Mohammad Taher Pilehvar
Venue:: ACL
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 17606–17619
Language:
URL:: https://aclanthology.org/2025.acl-long.861/
DOI:: 10.18653/v1/2025.acl-long.861
Bibkey:
Cite (ACL):: Dahyun Lee, Yongrae Jo, Haeju Park, and Moontae Lee. 2025. Shifting from Ranking to Set Selection for Retrieval Augmented Generation. In Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 17606–17619, Vienna, Austria. Association for Computational Linguistics.
Cite (Informal):: Shifting from Ranking to Set Selection for Retrieval Augmented Generation (Lee et al., ACL 2025)
Copy Citation:
PDF:: https://aclanthology.org/2025.acl-long.861.pdf

PDF Cite Search Fix data