Guideline Compliance in Task-Oriented Dialogue: The Chained Prior Approach

Xiangyu Wen; Jianyuan Zhong; Zhijian Xu; Qiang Xu

doi:10.18653/v1/2025.findings-naacl.377

Guideline Compliance in Task-Oriented Dialogue: The Chained Prior Approach

Xiangyu Wen, Jianyuan Zhong, Zhijian Xu, Qiang Xu

Abstract

Task-oriented dialogue (TOD) systems are widely used across various domains, including customer service, appointment scheduling, and technical support. In real-world scenarios, such systems must adhere to given operational guidelines. However, existing solutions based on large language models often cannot achieve strict guideline compliance, even when fine-tuned with domain knowledge. To address this issue, we introduce a novel TOD system named GuidedTOD, which explicitly considers domain-specific guidelines by integrating a policy module. This module employs a Markov Chain, termed Chained Prior, to efficiently encode and dynamically update guideline knowledge. During inference, the Chained Prior re-ranks outputs from the domain-expert language model using beam search, ensuring guideline adherence. Experimental results show that GuidedTOD significantly improves guideline compliance, achieving approximately 20% better action prediction accuracy than state-of-the-art solutions. Code is available here: https://github.com/cure-lab/GuidedTOD.

Anthology ID:: 2025.findings-naacl.377
Volume:: Findings of the Association for Computational Linguistics: NAACL 2025
Month:: April
Year:: 2025
Address:: Albuquerque, New Mexico
Editors:: Luis Chiruzzo, Alan Ritter, Lu Wang
Venue:: Findings
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 6765–6791
Language:
URL:: https://aclanthology.org/2025.findings-naacl.377/
DOI:: 10.18653/v1/2025.findings-naacl.377
Bibkey:
Cite (ACL):: Xiangyu Wen, Jianyuan Zhong, Zhijian Xu, and Qiang Xu. 2025. Guideline Compliance in Task-Oriented Dialogue: The Chained Prior Approach. In Findings of the Association for Computational Linguistics: NAACL 2025, pages 6765–6791, Albuquerque, New Mexico. Association for Computational Linguistics.
Cite (Informal):: Guideline Compliance in Task-Oriented Dialogue: The Chained Prior Approach (Wen et al., Findings 2025)
Copy Citation:
PDF:: https://aclanthology.org/2025.findings-naacl.377.pdf

PDF Cite Search Fix data