FastMCTS: A Simple Sampling Strategy for Data Synthesis

Peiji Li; Kai Lv; Yunfan Shao; Yichuan Ma; Linyang Li; Xiaoqing Zheng; Xipeng Qiu (邱锡鹏); Qipeng Guo

doi:10.18653/v1/2025.acl-long.1190

FastMCTS: A Simple Sampling Strategy for Data Synthesis

Peiji Li, Kai Lv, Yunfan Shao, Yichuan Ma, Linyang Li, Xiaoqing Zheng, Xipeng Qiu, Qipeng Guo

Abstract

Synthetic high-quality multi-step reasoning data can significantly enhance the performance of large language models on various tasks. However, most existing methods rely on rejection sampling, which generates trajectories independently and suffers from inefficiency and imbalanced sampling across problems of varying difficulty. In this work, we introduce FastMCTS, an innovative data synthesis strategy inspired by Monte Carlo Tree Search. FastMCTS provides a more efficient sampling method for multi-step reasoning data, offering step-level evaluation signals and promoting balanced sampling across problems of different difficulty levels. Experiments on both English and Chinese reasoning datasets demonstrate that FastMCTS generates over 30% more correct reasoning paths compared to rejection sampling as the number of generated tokens scales up. Furthermore, under comparable synthetic data budgets, models trained on FastMCTS-generated data outperform those trained on rejection sampling data by 3.9% across multiple benchmarks. As a lightweight sampling strategy, FastMCTS offers a practical and efficient alternative for synthesizing high-quality reasoning data.

Anthology ID:: 2025.acl-long.1190
Volume:: Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Month:: July
Year:: 2025
Address:: Vienna, Austria
Editors:: Wanxiang Che, Joyce Nabende, Ekaterina Shutova, Mohammad Taher Pilehvar
Venue:: ACL
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 24405–24422
Language:
URL:: https://aclanthology.org/2025.acl-long.1190/
DOI:: 10.18653/v1/2025.acl-long.1190
Bibkey:
Cite (ACL):: Peiji Li, Kai Lv, Yunfan Shao, Yichuan Ma, Linyang Li, Xiaoqing Zheng, Xipeng Qiu, and Qipeng Guo. 2025. FastMCTS: A Simple Sampling Strategy for Data Synthesis. In Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 24405–24422, Vienna, Austria. Association for Computational Linguistics.
Cite (Informal):: FastMCTS: A Simple Sampling Strategy for Data Synthesis (Li et al., ACL 2025)
Copy Citation:
PDF:: https://aclanthology.org/2025.acl-long.1190.pdf

PDF Cite Search Fix data