ProtoCycle: Reflective Tool-Augmented Planning for Text-Guided Protein Design

Yutang Ge; Guojiang Zhao; Sihang Li; Zheng Cheng; Zifeng Zhao; Hanchen Xia; Guolin Ke; Linfeng Zhang; Zhifeng Gao; Yu Guang Wang

ProtoCycle: Reflective Tool-Augmented Planning for Text-Guided Protein Design

Yutang Ge, Guojiang Zhao, Sihang Li, Zheng Cheng, Zifeng Zhao, Hanchen Xia, Guolin Ke, Linfeng Zhang, Zhifeng Gao, Yu Guang Wang

Abstract

Designing proteins that satisfy natural language functional requirements is a central goal in protein engineering. A straightforward baseline is to fine-tune generic instruction-tuned LLMs as direct text-to-sequence generators, but this is data- and compute-hungry. With limited supervision, LLMs can produce coherent plans in text yet fail to reliably realize them as sequences. This plan–execute gap motivates ProtoCycle, an agentic framework for protein design that uses LLMs primarily to drive a multi-round, feedback-driven decision cycle. ProtoCycle couples an LLM planner with a lightweight tool environment designed to emulate the iterative workflow of human protein engineers and uses LLM-driven reflection on tool feedback to revise plans. Trained with supervised trajectories and online reinforcement learning, ProtoCycle achieves strong language alignment while maintaining competitive foldability, and ablations show that reflection substantially improves sequence quality.

Anthology ID:: 2026.findings-acl.763
Volume:: Findings of the Association for Computational Linguistics: ACL 2026
Month:: July
Year:: 2026
Address:: San Diego, California, United States
Editors:: Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:: Findings
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 15562–15586
Language:
URL:: https://aclanthology.org/2026.findings-acl.763/
DOI:
Bibkey:
Cite (ACL):: Yutang Ge, Guojiang Zhao, Sihang Li, Zheng Cheng, Zifeng Zhao, Hanchen Xia, Guolin Ke, Linfeng Zhang, Zhifeng Gao, and Yu Guang Wang. 2026. ProtoCycle: Reflective Tool-Augmented Planning for Text-Guided Protein Design. In Findings of the Association for Computational Linguistics: ACL 2026, pages 15562–15586, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):: ProtoCycle: Reflective Tool-Augmented Planning for Text-Guided Protein Design (Ge et al., Findings 2026)
Copy Citation:
PDF:: https://aclanthology.org/2026.findings-acl.763.pdf
Checklist:: 2026.findings-acl.763.checklist.pdf

PDF Cite Search Checklist Fix data