JTPRO: A Joint Tool–Prompt Reflective Optimization Framework for Language Agents

Sandip Ghoshal; Anshul Mittal; Jyotika Singh; Miguel Ballesteros; Weiyi Sun; Fang Tu; Shailender Singh; Yassine Benajiba; Fahad Shah; Sujeeth Bharadwaj; Sujith Ravi; Dan Roth

JTPRO: A Joint Tool–Prompt Reflective Optimization Framework for Language Agents

Sandip Ghoshal, Anshul Mittal, Jyotika Singh, Miguel Ballesteros, Weiyi Sun, Fang Tu, Shailender Singh, Yassine Benajiba, Fahad Shah, Sujeeth Bharadwaj, Sujith Ravi, Dan Roth

Abstract

Large language model (LLM) agents augmented with external tools often struggle as number of tools grow large and become domain-specific. In such settings, ambiguous tool descriptions and under-specified agent instructions frequently lead to tool mis-selection and incorrect slot/value instantiation. We hypothesize that this is due to two root causes: generic, one-size-fits-all prompts that ignore tool-specific nuances, and underspecified tool schemas that lack clear guidance on when and how to use each tool and how to format its parameters. We introduce Joint Tool-Prompt Reflective Optimization (JTPRO), a framework for improving tool-calling reliability in trace-supervised settings by iteratively using rollout-driven reflection to co-optimize global instructions and per-tool schema/argument descriptions for accurate tool selection and argument instantiation in large tool inventories. JTPRO is designed to preserve only tool-local cues needed for correct disambiguation and slot filling. We evaluate JTPRO across multi-tool benchmarks, which account for different number of tools using three metrics: Tool Selection Accuracy (TSA), Slot Filling Accuracy(SFA), and Overall Success Rate(OSR) (correct tool + correct slots + correct values). JTPRO consistently outperforms strong baselines, including CoT-style agents, and reflective prompt optimizers such as GEPA by 5%–20% (relative) on OSR. Ablations show that joint optimization of instructions and tool schemas is more effective and robust than optimizing either component in isolation.

Anthology ID:: 2026.findings-acl.2017
Volume:: Findings of the Association for Computational Linguistics: ACL 2026
Month:: July
Year:: 2026
Address:: San Diego, California, United States
Editors:: Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:: Findings
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 40573–40595
Language:
URL:: https://aclanthology.org/2026.findings-acl.2017/
DOI:
Bibkey:
Cite (ACL):: Sandip Ghoshal, Anshul Mittal, Jyotika Singh, Miguel Ballesteros, Weiyi Sun, Fang Tu, Shailender Singh, Yassine Benajiba, Fahad Shah, Sujeeth Bharadwaj, Sujith Ravi, and Dan Roth. 2026. JTPRO: A Joint Tool–Prompt Reflective Optimization Framework for Language Agents. In Findings of the Association for Computational Linguistics: ACL 2026, pages 40573–40595, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):: JTPRO: A Joint Tool–Prompt Reflective Optimization Framework for Language Agents (Ghoshal et al., Findings 2026)
Copy Citation:
PDF:: https://aclanthology.org/2026.findings-acl.2017.pdf
Checklist:: 2026.findings-acl.2017.checklist.pdf

PDF Cite Search Checklist Fix data