SGA-MCTS: Decoupling Planning from Execution via Training-Free Atomic Experience Retrieval

Xin Xie; Dongyun Xue; Wuguannan Yao; Mingxiao Feng; Wengang Zhou; Xiang Qi; Houqiang Li; Peng Zhang

SGA-MCTS: Decoupling Planning from Execution via Training-Free Atomic Experience Retrieval

Xin Xie, Dongyun Xue, Wuguannan Yao, Mingxiao Feng, Wengang Zhou, Xiang Qi, Houqiang Li, Peng Zhang

Abstract

LLM-powered systems require complex multi-step decision-making abilities to solve real-world tasks, yet current planning approaches face a trade-off between the high latency of inference-time search and the limited generalization of supervised fine-tuning. To address this limitation, we introduce SGA-MCTS, a framework that casts LLM planning as non-parametric retrieval. Offline, we leverage Monte Carlo Tree Search (MCTS) to explore the solution space and distill high-fidelity trajectories into State-Goal-Action (SGA) atoms. These atoms are de-lexicalized primitives that abstract concrete entities into symbolic slots, preserving reusable causal logic while discarding domain-specific noise. Online, a retrieval-augmented agent employs a hybrid symbolic-semantic mechanism to fetch relevant SGAs and re-ground them into the current context as soft reasoning hints. Empirical results on complex benchmarks demonstrate that this paradigm enables frozen, open-weights models to match the performance of SOTA systems (e.g., GPT-5) without task-specific fine-tuning. By effectively amortizing the heavy computational cost of search, SGA-MCTS achieves System 2 reasoning depth at System 1 inference speeds, rendering autonomous planning both scalable and real-time feasible.

Anthology ID:: 2026.findings-acl.60
Volume:: Findings of the Association for Computational Linguistics: ACL 2026
Month:: July
Year:: 2026
Address:: San Diego, California, United States
Editors:: Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:: Findings
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 1176–1193
Language:
URL:: https://aclanthology.org/2026.findings-acl.60/
DOI:
Bibkey:
Cite (ACL):: Xin Xie, Dongyun Xue, Wuguannan Yao, Mingxiao Feng, Wengang Zhou, Xiang Qi, Houqiang Li, and Peng Zhang. 2026. SGA-MCTS: Decoupling Planning from Execution via Training-Free Atomic Experience Retrieval. In Findings of the Association for Computational Linguistics: ACL 2026, pages 1176–1193, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):: SGA-MCTS: Decoupling Planning from Execution via Training-Free Atomic Experience Retrieval (Xie et al., Findings 2026)
Copy Citation:
PDF:: https://aclanthology.org/2026.findings-acl.60.pdf
Checklist:: 2026.findings-acl.60.checklist.pdf

PDF Cite Search Checklist Fix data