Tailored Sequence to Sequence Models to Different Conversation Scenarios

Hainan Zhang; Yanyan Lan; Jiafeng Guo (嘉丰 郭); Jun Xu; Xueqi Cheng (程学旗)

doi:10.18653/v1/P18-1137

Tailored Sequence to Sequence Models to Different Conversation Scenarios

Hainan Zhang, Yanyan Lan, Jiafeng Guo, Jun Xu, Xueqi Cheng

Abstract

Sequence to sequence (Seq2Seq) models have been widely used for response generation in the area of conversation. However, the requirements for different conversation scenarios are distinct. For example, customer service requires the generated responses to be specific and accurate, while chatbot prefers diverse responses so as to attract different users. The current Seq2Seq model fails to meet these diverse requirements, by using a general average likelihood as the optimization criteria. As a result, it usually generates safe and commonplace responses, such as ‘I don’t know’. In this paper, we propose two tailored optimization criteria for Seq2Seq to different conversation scenarios, i.e., the maximum generated likelihood for specific-requirement scenario, and the conditional value-at-risk for diverse-requirement scenario. Experimental results on the Ubuntu dialogue corpus (Ubuntu service scenario) and Chinese Weibo dataset (social chatbot scenario) show that our proposed models not only satisfies diverse requirements for different scenarios, but also yields better performances against traditional Seq2Seq models in terms of both metric-based and human evaluations.

Anthology ID:: P18-1137
Volume:: Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Month:: July
Year:: 2018
Address:: Melbourne, Australia
Editors:: Iryna Gurevych, Yusuke Miyao
Venue:: ACL
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 1479–1488
Language:
URL:: https://aclanthology.org/P18-1137/
DOI:: 10.18653/v1/P18-1137
Bibkey:
Cite (ACL):: Hainan Zhang, Yanyan Lan, Jiafeng Guo, Jun Xu, and Xueqi Cheng. 2018. Tailored Sequence to Sequence Models to Different Conversation Scenarios. In Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 1479–1488, Melbourne, Australia. Association for Computational Linguistics.
Cite (Informal):: Tailored Sequence to Sequence Models to Different Conversation Scenarios (Zhang et al., ACL 2018)
Copy Citation:
PDF:: https://aclanthology.org/P18-1137.pdf
Poster:: P18-1137.Poster.pdf

PDF Cite Search Poster Fix data