R³A: Reinforced Reasoning for Relevance Assessment for RAG in User-Generated Content Platforms

Xiaowei Yuan; Lei Jin; Haoxin Zhang; Ziyang Huang; Yan Gao; Yiwu; Yao Hu; Jun Zhao; Kang Liu

R³A: Reinforced Reasoning for Relevance Assessment for RAG in User-Generated Content Platforms

Xiaowei Yuan, Lei Jin, Haoxin Zhang, Ziyang Huang, Yan Gao, Yiwu, Yao Hu, Jun Zhao, Kang Liu

Abstract

Retrieval-augmented generation (RAG) plays a critical role in user-generated content (UGC) platforms, but its effectiveness critically depends on accurate query–document relevance assessment. Despite recent advances in applying large language models (LLMs) to relevance modeling, UGC platforms present unique challenges: 1) ambiguous user intent due to sparse user feedback in RAG scenarios, and 2) asymmetric relevance, where relevance is driven by localized answer-bearing content rather than global query–document similarity. To address these issues, we propose the Reinforced Reasoning model for Relevance Assessment (R³A), which decomposes relevance assessment into intent inference and evidence grounding. R³A leverages auxiliary high-clicked documents to infer latent query intent, and extracts verbatim evidence fragments to ground relevance decisions, reducing noise sensitivity and improving asymmetric relevance modeling. Experimental results demonstrate that R³A substantially outperforms strong baselines on offline benchmarks, while the distilled R³A-1.5B model achieves significant gains in large-scale online A/B testing, effectively balancing performance and practical deployability.

Anthology ID:: 2026.acl-industry.10
Volume:: Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (ACL 2026)
Month:: July
Year:: 2026
Address:: San Diego, California, USA
Editors:: Yunyao Li, Georg Rehm, Mei Tu
Venue:: ACL
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 127–140
Language:
URL:: https://aclanthology.org/2026.acl-industry.10/
DOI:
Bibkey:
Cite (ACL):: Xiaowei Yuan, Lei Jin, Haoxin Zhang, Ziyang Huang, Yan Gao, Yiwu, Yao Hu, Jun Zhao, and Kang Liu. 2026. R³A: Reinforced Reasoning for Relevance Assessment for RAG in User-Generated Content Platforms. In Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (ACL 2026), pages 127–140, San Diego, California, USA. Association for Computational Linguistics.
Cite (Informal):: R³A: Reinforced Reasoning for Relevance Assessment for RAG in User-Generated Content Platforms (Yuan et al., ACL 2026)
Copy Citation:
PDF:: https://aclanthology.org/2026.acl-industry.10.pdf

PDF Cite Search Fix data