Human-centric dialog training via offline reinforcement learning Natasha Jaques author Judy Hanwen Shen author Asma Ghandeharioun author Craig Ferguson author Agata Lapedriza author Noah Jones author Shixiang Gu author Rosalind Picard author 2020-11 text Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP) Bonnie Webber editor Trevor Cohn editor Yulan He editor Yang Liu editor Association for Computational Linguistics Online conference publication jaques-etal-2020-human 10.18653/v1/2020.emnlp-main.327 https://aclanthology.org/2020.emnlp-main.327/ 2020-11 3985 4003