A Teacher-Student Framework for Maintainable Dialog Manager

Weikang Wang, Jiajun Zhang, Han Zhang, Mei-Yuh Hwang, Chengqing Zong, Zhifei Li


Abstract
Reinforcement learning (RL) is an attractive solution for task-oriented dialog systems. However, extending RL-based systems to handle new intents and slots requires a system redesign. The high maintenance cost makes it difficult to apply RL methods to practical systems on a large scale. To address this issue, we propose a practical teacher-student framework to extend RL-based dialog systems without retraining from scratch. Specifically, the “student” is an extended dialog manager based on a new ontology, and the “teacher” is existing resources used for guiding the learning process of the “student”. By specifying constraints held in the new dialog manager, we transfer knowledge of the “teacher” to the “student” without additional resources. Experiments show that the performance of the extended system is comparable to the system trained from scratch. More importantly, the proposed framework makes no assumption about the unsupported intents and slots, which makes it possible to improve RL-based systems incrementally.
Anthology ID:
D18-1415
Volume:
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing
Month:
October-November
Year:
2018
Address:
Brussels, Belgium
Editors:
Ellen Riloff, David Chiang, Julia Hockenmaier, Jun’ichi Tsujii
Venue:
EMNLP
SIG:
SIGDAT
Publisher:
Association for Computational Linguistics
Note:
Pages:
3803–3812
Language:
URL:
https://aclanthology.org/D18-1415
DOI:
10.18653/v1/D18-1415
Bibkey:
Cite (ACL):
Weikang Wang, Jiajun Zhang, Han Zhang, Mei-Yuh Hwang, Chengqing Zong, and Zhifei Li. 2018. A Teacher-Student Framework for Maintainable Dialog Manager. In Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, pages 3803–3812, Brussels, Belgium. Association for Computational Linguistics.
Cite (Informal):
A Teacher-Student Framework for Maintainable Dialog Manager (Wang et al., EMNLP 2018)
Copy Citation:
PDF:
https://aclanthology.org/D18-1415.pdf
Attachment:
 D18-1415.Attachment.zip