Grounding in social media: An approach to building a chit-chat dialogue model

Ritvik Choudhary, Daisuke Kawahara


Abstract
Building open-domain dialogue systems capable of rich human-like conversational ability is one of the fundamental challenges in language generation. However, even with recent advancements in the field, existing open-domain generative models fail to capture and utilize external knowledge, leading to repetitive or generic responses to unseen utterances. Current work on knowledge-grounded dialogue generation primarily focuses on persona incorporation or searching a fact-based structured knowledge source such as Wikipedia. Our method takes a broader and simpler approach, which aims to improve the raw conversation ability of the system by mimicking the human response behavior through casual interactions found on social media. Utilizing a joint retriever-generator setup, the model queries a large set of filtered comment data from Reddit to act as additional context for the seq2seq generator. Automatic and human evaluations on open-domain dialogue datasets demonstrate the effectiveness of our approach.
Anthology ID:
2022.naacl-srw.2
Volume:
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies: Student Research Workshop
Month:
July
Year:
2022
Address:
Hybrid: Seattle, Washington + Online
Editors:
Daphne Ippolito, Liunian Harold Li, Maria Leonor Pacheco, Danqi Chen, Nianwen Xue
Venue:
NAACL
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
9–15
Language:
URL:
https://aclanthology.org/2022.naacl-srw.2
DOI:
10.18653/v1/2022.naacl-srw.2
Bibkey:
Cite (ACL):
Ritvik Choudhary and Daisuke Kawahara. 2022. Grounding in social media: An approach to building a chit-chat dialogue model. In Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies: Student Research Workshop, pages 9–15, Hybrid: Seattle, Washington + Online. Association for Computational Linguistics.
Cite (Informal):
Grounding in social media: An approach to building a chit-chat dialogue model (Choudhary & Kawahara, NAACL 2022)
Copy Citation:
PDF:
https://aclanthology.org/2022.naacl-srw.2.pdf
Video:
 https://aclanthology.org/2022.naacl-srw.2.mp4
Data
DailyDialogDailyDialog++