Composite Task-Completion Dialogue Policy Learning via Hierarchical Deep Reinforcement Learning Baolin Peng author Xiujun Li author Lihong Li author Jianfeng Gao author Asli Celikyilmaz author Sungjin Lee author Kam-Fai Wong author 2017-09 text Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing Martha Palmer editor Rebecca Hwa editor Sebastian Riedel editor Association for Computational Linguistics Copenhagen, Denmark conference publication peng-etal-2017-composite 10.18653/v1/D17-1237 https://aclanthology.org/D17-1237/ 2017-09 2231 2240