Mid-Think: Training-Free Intermediate-Budget Reasoning via Token-Level Triggers

Van Yang; Shouren Wang; Debargha Ganguly; Xinpeng Li; Chaoda Song; Vikash Singh; Vipin Chaudhary; Xiaotian Han

Mid-Think: Training-Free Intermediate-Budget Reasoning via Token-Level Triggers

Van Yang, Shouren Wang, Debargha Ganguly, Xinpeng Li, Chaoda Song, Vikash Singh, Vipin Chaudhary, Xiaotian Han

Abstract

Reasoning language models are controlled through explicit modes such as Think and No-think, yet we find that these behaviors are largely governed by a few token-level triggers rather than high-level instructions. Through attention analysis and controlled prompting experiments, we show that a leading “Okay” token induces reasoning behavior, while the newline pattern following ‘</think>‘ suppresses it. Based on this observation, we propose Mid-Think, a simple training-free prompting format that combines these triggers to achieve intermediate-budget reasoning, consistently outperforming fixed-token and prompt-based baselines in terms of the accuracy–length trade-off. Furthermore, applying Mid-Think to RL training after SFT reduces training time by approximately 15% while improving final performance of Qwen3-8B on AIME from 69.8% to 72.4% and on GPQA from 58.5% to 61.1%, demonstrating its effectiveness for both inference-time control and RL-based reasoning training.

Anthology ID:: 2026.findings-acl.299
Volume:: Findings of the Association for Computational Linguistics: ACL 2026
Month:: July
Year:: 2026
Address:: San Diego, California, United States
Editors:: Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:: Findings
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 6024–6038
Language:
URL:: https://aclanthology.org/2026.findings-acl.299/
DOI:
Bibkey:
Cite (ACL):: Van Yang, Shouren Wang, Debargha Ganguly, Xinpeng Li, Chaoda Song, Vikash Singh, Vipin Chaudhary, and Xiaotian Han. 2026. Mid-Think: Training-Free Intermediate-Budget Reasoning via Token-Level Triggers. In Findings of the Association for Computational Linguistics: ACL 2026, pages 6024–6038, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):: Mid-Think: Training-Free Intermediate-Budget Reasoning via Token-Level Triggers (Yang et al., Findings 2026)
Copy Citation:
PDF:: https://aclanthology.org/2026.findings-acl.299.pdf
Checklist:: 2026.findings-acl.299.checklist.pdf

PDF Cite Search Checklist Fix data