Bridging Relevance and Reasoning: Rationale Distillation in Retrieval-Augmented Generation

Pengyue Jia; Derong Xu; Xiaopeng Li; Zhaocheng Du; Xiangyang Li; Yichao Wang; Yuhao Wang; Qidong Liu; Maolin Wang; Huifeng Guo; Ruiming Tang; Xiangyu Zhao

doi:10.18653/v1/2025.findings-acl.220

Bridging Relevance and Reasoning: Rationale Distillation in Retrieval-Augmented Generation

Pengyue Jia, Derong Xu, Xiaopeng Li, Zhaocheng Du, Xiangyang Li, Yichao Wang, Yuhao Wang, Qidong Liu, Maolin Wang, Huifeng Guo, Ruiming Tang, Xiangyu Zhao

Abstract

The reranker and generator are two critical components in the Retrieval-Augmented Generation (i.e., RAG) pipeline, responsible for ranking relevant documents and generating responses. However, due to differences in pre-training data and objectives, there is an inevitable gap between the documents ranked as relevant by the reranker and those required by the generator to support answering the query. To address this gap, we propose RADIO, a novel and practical preference alignment framework with RAtionale DIstillatiOn. Specifically, We first propose a rationale extraction method that leverages the reasoning capabilities of large language models (LLMs) to extract the rationales necessary for answering the query. Subsequently, a rationale-based alignment process is designed to rerank the documents based on the extracted rationales, and fine-tune the reranker to align the preferences. We conduct extensive experiments on two tasks across three datasets to demonstrate the effectiveness of our approach compared to baseline methods. Our code is released online to ease reproduction.

Anthology ID:: 2025.findings-acl.220
Volume:: Findings of the Association for Computational Linguistics: ACL 2025
Month:: July
Year:: 2025
Address:: Vienna, Austria
Editors:: Wanxiang Che, Joyce Nabende, Ekaterina Shutova, Mohammad Taher Pilehvar
Venue:: Findings
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 4242–4256
Language:
URL:: https://aclanthology.org/2025.findings-acl.220/
DOI:: 10.18653/v1/2025.findings-acl.220
Bibkey:
Cite (ACL):: Pengyue Jia, Derong Xu, Xiaopeng Li, Zhaocheng Du, Xiangyang Li, Yichao Wang, Yuhao Wang, Qidong Liu, Maolin Wang, Huifeng Guo, Ruiming Tang, and Xiangyu Zhao. 2025. Bridging Relevance and Reasoning: Rationale Distillation in Retrieval-Augmented Generation. In Findings of the Association for Computational Linguistics: ACL 2025, pages 4242–4256, Vienna, Austria. Association for Computational Linguistics.
Cite (Informal):: Bridging Relevance and Reasoning: Rationale Distillation in Retrieval-Augmented Generation (Jia et al., Findings 2025)
Copy Citation:
PDF:: https://aclanthology.org/2025.findings-acl.220.pdf

PDF Cite Search Fix data