PyTOD: Programmable Task-Oriented Dialogue with Execution Feedback

Alexandru Coca, Bo-Hsiang Tseng, Peter Boothroyd, Jianpeng Cheng, Zhenxing Zhang, Mark Gaynor, Joe Stacey, Tristan Guigue, Héctor Martínez Alonso, Diarmuid Ó Séaghdha, Anders Johannsen


Abstract
Programmable task-oriented dialogue (TOD) agents enable language models to follow structured dialogue policies, but their effectiveness hinges on accurate dialogue state tracking (DST). We present PyTOD, an agent that generates executable code to track dialogue state and uses policy and execution feedback for efficient error correction. To achieve this, PyTOD employs a simple constrained decoding approach, using a language model instead of grammar rules to follow API schemata. This leads to state-of-the-art DST performance on the challenging SGD benchmark. Our experiments show that PyTOD surpasses strong baselines in both accuracy and cross-turn consistency, demonstrating the effectiveness of execution-aware state tracking.
Anthology ID:
2025.sigdial-1.6
Volume:
Proceedings of the 26th Annual Meeting of the Special Interest Group on Discourse and Dialogue
Month:
August
Year:
2025
Address:
Avignon, France
Editors:
Frédéric Béchet, Fabrice Lefèvre, Nicholas Asher, Seokhwan Kim, Teva Merlin
Venue:
SIGDIAL
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
93–112
Language:
URL:
https://aclanthology.org/2025.sigdial-1.6/
DOI:
Bibkey:
Cite (ACL):
Alexandru Coca, Bo-Hsiang Tseng, Peter Boothroyd, Jianpeng Cheng, Zhenxing Zhang, Mark Gaynor, Joe Stacey, Tristan Guigue, Héctor Martínez Alonso, Diarmuid Ó Séaghdha, and Anders Johannsen. 2025. PyTOD: Programmable Task-Oriented Dialogue with Execution Feedback. In Proceedings of the 26th Annual Meeting of the Special Interest Group on Discourse and Dialogue, pages 93–112, Avignon, France. Association for Computational Linguistics.
Cite (Informal):
PyTOD: Programmable Task-Oriented Dialogue with Execution Feedback (Coca et al., SIGDIAL 2025)
Copy Citation:
PDF:
https://aclanthology.org/2025.sigdial-1.6.pdf