ComfyUI-R1: Exploring Reasoning Models for Workflow Generation

Zhenran Xu; Yiyu Wang; Yangxue; Longyue Wang; Weihua Luo; Kaifu Zhang; Baotian Hu; Min Zhang

ComfyUI-R1: Exploring Reasoning Models for Workflow Generation

Zhenran Xu, Yiyu Wang, Yangxue, Longyue Wang, Weihua Luo, Kaifu Zhang, Baotian Hu, Min Zhang

Abstract

AI-generated content has evolved from monolithic models to modular workflows, particularly on platforms like ComfyUI, enabling customization in creative pipelines. However, crafting effective workflows requires great expertise to orchestrate numerous specialized components, presenting a steep learning curve for users. To address this challenge, we introduce ComfyUI-R1, the first large reasoning model for automated workflow generation. Starting with our curated knowledge bases, we construct long chain-of-thought (CoT) reasoning data, including node selection, workflow planning, and code-level workflow representation. ComfyUI-R1 is trained through a two-stage framework: (1) CoT fine-tuning for cold start, adapting models to the ComfyUI domain; (2) reinforcement learning for incentivizing reasoning capability, guided by a fine-grained rule-metric hybrid reward, ensuring format validity, structural integrity, and node-level fidelity. Experiments show that our 7B-parameter model achieves a 97% format validity rate, along with high pass rate, node-level and graph-level F1 scores, surpassing prior state-of-the-art methods that employ leading closed-source models such as GPT-4o and Claude series. Qualitative analysis further highlights our model’s strength in synthesizing intricate workflows with diverse nodes, aligning with human instructions, and generalizing to newly introduced nodes, underscoring the potential of long CoT reasoning in AI art creation.

Anthology ID:: 2026.findings-acl.146
Volume:: Findings of the Association for Computational Linguistics: ACL 2026
Month:: July
Year:: 2026
Address:: San Diego, California, United States
Editors:: Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:: Findings
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 2999–3013
Language:
URL:: https://aclanthology.org/2026.findings-acl.146/
DOI:
Bibkey:
Cite (ACL):: Zhenran Xu, Yiyu Wang, Yangxue, Longyue Wang, Weihua Luo, Kaifu Zhang, Baotian Hu, and Min Zhang. 2026. ComfyUI-R1: Exploring Reasoning Models for Workflow Generation. In Findings of the Association for Computational Linguistics: ACL 2026, pages 2999–3013, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):: ComfyUI-R1: Exploring Reasoning Models for Workflow Generation (Xu et al., Findings 2026)
Copy Citation:
PDF:: https://aclanthology.org/2026.findings-acl.146.pdf
Checklist:: 2026.findings-acl.146.checklist.pdf

PDF Cite Search Checklist Fix data