DecIF: Improving Instruction-Following through Decomposition

Tingfeng Hui; Pengyu Zhu; Bowen Ping; Ling Tang; Guanting Dong; Yaqi Zhang; Sen Su

DecIF: Improving Instruction-Following through Decomposition

Tingfeng Hui, Pengyu Zhu, Bowen Ping, Ling Tang, Guanting Dong, Yaqi Zhang, Sen Su

Abstract

We propose a novel data synthesis framework, DecIF, which automatically generates accurate and diverse instruction-following data from scratch for supervised fine-tuning (SFT) and reinforcement learning (RL), leveraging large language models (LLMs) and minimal external resources. By decomposing the data synthesis pipeline into fine-grained steps, DecIF achieves meticulous quality and diversity control over generated instruction-following data. Extensive experiments across both SFT and RL demonstrate DecIF’s strong capability to flexibly synthesize accurate instruction-following data for both paradigms compared to comprehensive baselines. Further analysis demonstrates the framework’s robustness, scalability, and computational efficiency in instruction-following data generation, while its modular design ensures straightforward implementation and reproducibility.

Anthology ID:: 2026.acl-long.36
Volume:: Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Month:: July
Year:: 2026
Address:: San Diego, California, United States
Editors:: Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:: ACL
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 848–867
Language:
URL:: https://aclanthology.org/2026.acl-long.36/
DOI:
Bibkey:
Cite (ACL):: Tingfeng Hui, Pengyu Zhu, Bowen Ping, Ling Tang, Guanting Dong, Yaqi Zhang, and Sen Su. 2026. DecIF: Improving Instruction-Following through Decomposition. In Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 848–867, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):: DecIF: Improving Instruction-Following through Decomposition (Hui et al., ACL 2026)
Copy Citation:
PDF:: https://aclanthology.org/2026.acl-long.36.pdf
Checklist:: 2026.acl-long.36.checklist.pdf

PDF Cite Search Checklist Fix data