GENRA: Enhancing Zero-shot Retrieval with Rank Aggregation

Georgios Katsimpras, Georgios Paliouras


Abstract
Large Language Models (LLMs) have been shown to effectively perform zero-shot document retrieval, a process that typically consists of two steps: i) retrieving relevant documents, and ii) re-ranking them based on their relevance to the query. This paper presents GENRA, a new approach to zero-shot document retrieval that incorporates rank aggregation to improve retrieval effectiveness. Given a query, GENRA first utilizes LLMs to generate informative passages that capture the query’s intent. These passages are then employed to guide the retrieval process, selecting similar documents from the corpus. Next, we use LLMs again for a second refinement step. This step can be configured for either direct relevance assessment of each retrieved document or for re-ranking the retrieved documents. Ultimately, both approaches ensure that only the most relevant documents are kept. Upon this filtered set of documents, we perform multi-document retrieval, generating individual rankings for each document. As a final step, GENRA leverages rank aggregation, combining the individual rankings to produce a single refined ranking. Extensive experiments on benchmark datasets demonstrate that GENRA improves existing approaches, highlighting the effectiveness of the proposed methodology in zero-shot retrieval.
Anthology ID:
2024.emnlp-main.431
Volume:
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing
Month:
November
Year:
2024
Address:
Miami, Florida, USA
Editors:
Yaser Al-Onaizan, Mohit Bansal, Yun-Nung Chen
Venue:
EMNLP
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
7566–7577
Language:
URL:
https://aclanthology.org/2024.emnlp-main.431/
DOI:
10.18653/v1/2024.emnlp-main.431
Bibkey:
Cite (ACL):
Georgios Katsimpras and Georgios Paliouras. 2024. GENRA: Enhancing Zero-shot Retrieval with Rank Aggregation. In Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, pages 7566–7577, Miami, Florida, USA. Association for Computational Linguistics.
Cite (Informal):
GENRA: Enhancing Zero-shot Retrieval with Rank Aggregation (Katsimpras & Paliouras, EMNLP 2024)
Copy Citation:
PDF:
https://aclanthology.org/2024.emnlp-main.431.pdf