ReasonIF: Large Reasoning Models Fail to Follow Instructions During Reasoning

Yongchan Kwon; Shang Zhu; Federico Bianchi; Kaitlyn Zhou; James Zou

ReasonIF: Large Reasoning Models Fail to Follow Instructions During Reasoning

Yongchan Kwon, Shang Zhu, Federico Bianchi, Kaitlyn Zhou, James Zou

Abstract

The ability of large language models (LLMs) to follow user instructions is central to their reliability, safety, and usefulness. While prior studies assess instruction adherence in the model’s main responses, we argue that it is also critical for large reasoning models (LRMs) to follow user instructions throughout their reasoning process. Reasoning instruction following makes LRMs more controllable and transparent, while reducing risks of undesirable shortcuts, hallucinations, or reward hacking within reasoning traces. To evaluate this dimension, we introduce ReasonIF, a systematic benchmark for assessing reasoning instruction following. ReasonIF includes six categories of instruction prompts, spanning multilingual reasoning, and length control. Across many open-source LRMs including GPT-OSS, Qwen3, and DeepSeek-R1, we find substantial failures in reasoning instruction adherence: the highest instruction following score (IFS) remains below 0.25, meaning that fewer than 25% of reasoning traces comply with the given instructions. Notably, as task difficulty increases, reasoning instruction following degrades further. We also explore two strategies to enhance reasoning instruction fidelity: (1) multi-turn reasoning and (2) Reasoning Instruction Finetuning (RIF) using synthetic data. RIF improves the IFS of GPT-OSS-20B from 0.11 to 0.27, indicating measurable progress but leaving ample room for improvement. We hope this work draws attention to reasoning-level instruction adherence as an underexplored but critical aspect of model alignment, and helps pave the way toward more controllable, interpretable, and trustworthy reasoning models.

Anthology ID:: 2026.findings-acl.1456
Volume:: Findings of the Association for Computational Linguistics: ACL 2026
Month:: July
Year:: 2026
Address:: San Diego, California, United States
Editors:: Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:: Findings
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 29149–29164
Language:
URL:: https://aclanthology.org/2026.findings-acl.1456/
DOI:
Bibkey:
Cite (ACL):: Yongchan Kwon, Shang Zhu, Federico Bianchi, Kaitlyn Zhou, and James Zou. 2026. ReasonIF: Large Reasoning Models Fail to Follow Instructions During Reasoning. In Findings of the Association for Computational Linguistics: ACL 2026, pages 29149–29164, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):: ReasonIF: Large Reasoning Models Fail to Follow Instructions During Reasoning (Kwon et al., Findings 2026)
Copy Citation:
PDF:: https://aclanthology.org/2026.findings-acl.1456.pdf
Checklist:: 2026.findings-acl.1456.checklist.pdf

PDF Cite Search Checklist Fix data