ReRec: Reasoning-Augmented LLM-based Recommendation Assistant via Reinforcement Fine-tuning

Jiani Huang; Shijie Wang; Liang-Bo Ning; Wenqi Fan; Qing Li

ReRec: Reasoning-Augmented LLM-based Recommendation Assistant via Reinforcement Fine-tuning

Jiani Huang, Shijie Wang, Liang-Bo Ning, Wenqi Fan, Li Qing

Abstract

With the rise of LLMs, there is an increasing need for intelligent recommendation assistants that can handle complex queries and provide personalized, reasoning-driven recommendations. LLM-based recommenders show potential but face challenges in multi-step reasoning, underscoring the need for reasoning-augmented systems. To address this gap, we propose ReRec, a novel reinforcement fine-tuning (RFT) framework designed to improve LLM reasoning in complex recommendation tasks. Our framework introduces three key components: (1) Dual-Graph Enhanced Reward Shaping, integrating recommendation metrics like NDCG@K with Query Alignment and Preference Alignment Scores to provide fine-grained reward signals for LLM optimization; (2) Reasoning-aware Advantage Estimation, which decomposes LLM outputs into reasoning segments and penalizes incorrect steps to enhance reasoning of recommendation; and (3) Online Curriculum Scheduler, dynamically assess query difficulty and organize training curriculum to ensure stable learning during RFT. Experiments demonstrate that ReRec outperforms state-of-the-art baselines and preserves core abilities like instruction-following and general knowledge. Our codes are available at https://anonymous.4open.science/r/ReRec/.

Anthology ID:: 2026.acl-long.964
Volume:: Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Month:: July
Year:: 2026
Address:: San Diego, California, United States
Editors:: Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:: ACL
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 21040–21055
Language:
URL:: https://aclanthology.org/2026.acl-long.964/
DOI:
Bibkey:
Cite (ACL):: Jiani Huang, Shijie Wang, Liang-Bo Ning, Wenqi Fan, and Li Qing. 2026. ReRec: Reasoning-Augmented LLM-based Recommendation Assistant via Reinforcement Fine-tuning. In Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 21040–21055, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):: ReRec: Reasoning-Augmented LLM-based Recommendation Assistant via Reinforcement Fine-tuning (Huang et al., ACL 2026)
Copy Citation:
PDF:: https://aclanthology.org/2026.acl-long.964.pdf
Checklist:: 2026.acl-long.964.checklist.pdf

PDF Cite Search Checklist Fix data