Jingbo Sun
Other people with similar names: Jingbo Sun
Unverified author pages with similar names: Jingbo Sun
2026
AutoSearch: Adaptive Search Depth for Efficient Agentic RAG via Reinforcement Learning
Jingbo Sun | Wenyue Chong | Songjun Tu | Qichao Zhang | Yaocheng Zhang | Jiajun Chai | Xiaohan Wang | Wei Lin | Guojun Yin | Dongbin Zhao
Findings of the Association for Computational Linguistics: ACL 2026
Jingbo Sun | Wenyue Chong | Songjun Tu | Qichao Zhang | Yaocheng Zhang | Jiajun Chai | Xiaohan Wang | Wei Lin | Guojun Yin | Dongbin Zhao
Findings of the Association for Computational Linguistics: ACL 2026
Agentic retrieval-augmented generation (RAG) systems enable large language models (LLMs) to solve complex tasks through multi-step interaction with external retrieval tools. However, such multi-step interaction often involves redundant search steps, incurring substantial computational cost and latency. Prior work limits search depth (i.e., the number of search steps) to reduce cost, but this often leads to underexploration of complex questions. To address this, we first investigate how search depth affects accuracy and find a minimal sufficient search depth that defines an accuracy-efficiency trade-off, jointly determined by question complexity and the agent’s capability. Furthermore, we propose AutoSearch, a reinforcement learning framework that evaluates each search step via self-generated intermediate answers. By a self-answering mechanism, AutoSearch identifies the minimal sufficient search depth and promotes efficient search by rewarding its attainment while penalizing over-searching. In addition, reward mechanisms are introduced to stabilize search behavior and improve answer quality on complex questions. Extensive experiments on multiple benchmarks show that AutoSearch achieves a superior accuracy-efficiency trade-off, alleviating over-searching while preserving search quality.
Beyond Query Memorization: Large Language Model Routing with Query Decomposition and Historical Matching
Bo Lv | Jingbo Sun | Jianwei Lv | Chen Tang | Shaojie Zhang | Nayu Liu | Guoxin Yu | Zihao Li | Qichao Zhang | Dongbin Zhao | Ping Luo | Yue Yu
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Bo Lv | Jingbo Sun | Jianwei Lv | Chen Tang | Shaojie Zhang | Nayu Liu | Guoxin Yu | Zihao Li | Qichao Zhang | Dongbin Zhao | Ping Luo | Yue Yu
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Optimizing the trade-off among predictive performance and computational cost is a central focus in the deployment of Large Language Models (LLMs). Current routing methods primarily rely on direct mapping from queries to models based on surface-level features, making them susceptible to the memorization trap and leading to poor generalizability on out-of-distribution (OOD) data. In this paper, we propose DecoR, a novel routing framework that recasts the routing task as a matching process of sifting similar queries from historical logs, effectively mitigating the memorization trap. To enhance matching accuracy, we introduce a query capability deconstruction method that decouples linguistic surface forms from task-intrinsic requirements, directing matching toward capability dimensions to ground decisions in essential task attributes. Furthermore, we develop CodaSet, a comprehensive benchmark for assessing routing generalization, where experimental results demonstrate that DecoR maintains superior accuracy while substantially lowering inference costs across both in-distribution and OOD settings.