Counterfactual Off-Policy Training for Neural Dialogue Generation Qingfu Zhu author Wei-Nan Zhang author Ting Liu author William Yang Wang author 2020-11 text Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP) Bonnie Webber editor Trevor Cohn editor Yulan He editor Yang Liu editor Association for Computational Linguistics Online conference publication zhu-etal-2020-counterfactual 10.18653/v1/2020.emnlp-main.276 https://aclanthology.org/2020.emnlp-main.276/ 2020-11 3438 3448