Task-Completion Dialogue Policy Learning via Monte Carlo Tree Search with Dueling Network Sihan Wang author Kaijie Zhou author Kunfeng Lai author Jianping Shen author 2020-11 text Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP) Bonnie Webber editor Trevor Cohn editor Yulan He editor Yang Liu editor Association for Computational Linguistics Online conference publication wang-etal-2020-task 10.18653/v1/2020.emnlp-main.278 https://aclanthology.org/2020.emnlp-main.278/ 2020-11 3461 3471