Conversational Word Embedding for Retrieval-Based Dialog System

Wentao Ma, Yiming Cui, Ting Liu, Dong Wang, Shijin Wang, Guoping Hu


Abstract
Human conversations contain many types of information, e.g., knowledge, common sense, and language habits. In this paper, we propose a conversational word embedding method named PR-Embedding, which utilizes the conversation pairs <post, reply> to learn word embedding. Different from previous works, PR-Embedding uses the vectors from two different semantic spaces to represent the words in post and reply. To catch the information among the pair, we first introduce the word alignment model from statistical machine translation to generate the cross-sentence window, then train the embedding on word-level and sentence-level. We evaluate the method on single-turn and multi-turn response selection tasks for retrieval-based dialog systems. The experiment results show that PR-Embedding can improve the quality of the selected response.
Anthology ID:
2020.acl-main.127
Volume:
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics
Month:
July
Year:
2020
Address:
Online
Editors:
Dan Jurafsky, Joyce Chai, Natalie Schluter, Joel Tetreault
Venue:
ACL
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
1375–1380
Language:
URL:
https://aclanthology.org/2020.acl-main.127
DOI:
10.18653/v1/2020.acl-main.127
Bibkey:
Cite (ACL):
Wentao Ma, Yiming Cui, Ting Liu, Dong Wang, Shijin Wang, and Guoping Hu. 2020. Conversational Word Embedding for Retrieval-Based Dialog System. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, pages 1375–1380, Online. Association for Computational Linguistics.
Cite (Informal):
Conversational Word Embedding for Retrieval-Based Dialog System (Ma et al., ACL 2020)
Copy Citation:
PDF:
https://aclanthology.org/2020.acl-main.127.pdf
Video:
 http://slideslive.com/38928772
Code
 wtma/PR-Embedding