Collecting High-quality Multi-modal Conversational Search Data for E-Commerce

Marcus Collins, Oleg Rokhlenko, Eugene Agichtein, Shervin Malmasi


Abstract
Continued improvement of conversational assistants in knowledge-rich domains like E-Commerce requires large volumes of realistic high-quality conversation data to power increasingly sophisticated large language model chatbots, dialogue managers, response rankers, and recommenders. The problem is exacerbated for multi-modal interactions in realistic conversational product search and recommendation. Here, an artificial sales agent must interact intelligently with a customer using both textual and visual information and incorporate results from external search systems, such as a product catalog. Yet, it remains an open question how to best crowd-source large-scale, naturalistic multi-modal dialogue and action data, required to train such an artificial agent. We describe our crowd-sourced task where one worker (the Buyer) plays the role of the customer, and another (the Seller) plays the role of the sales agent. We identify subtle interactions between one worker’s environment and their partner’s behavior mediated by workers’ word choice. We find that limiting information presented to the Buyer, both in their backstory and by the Seller, improves conversation quality. We also show how conversations are improved through minimal automated Seller “coaching”. While typed and spoken messages are slightly different, the differences are not as large as frequently assumed. We plan to release our platform code and the resulting dialogues to advance research on conversational search agents.
Anthology ID:
2024.knowledgenlp-1.3
Volume:
Proceedings of the 3rd Workshop on Knowledge Augmented Methods for NLP
Month:
August
Year:
2024
Address:
Bangkok, Thailand
Editors:
Wenhao Yu, Weijia Shi, Michihiro Yasunaga, Meng Jiang, Chenguang Zhu, Hannaneh Hajishirzi, Luke Zettlemoyer, Zhihan Zhang
Venues:
KnowledgeNLP | WS
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
30–43
Language:
URL:
https://aclanthology.org/2024.knowledgenlp-1.3
DOI:
10.18653/v1/2024.knowledgenlp-1.3
Bibkey:
Cite (ACL):
Marcus Collins, Oleg Rokhlenko, Eugene Agichtein, and Shervin Malmasi. 2024. Collecting High-quality Multi-modal Conversational Search Data for E-Commerce. In Proceedings of the 3rd Workshop on Knowledge Augmented Methods for NLP, pages 30–43, Bangkok, Thailand. Association for Computational Linguistics.
Cite (Informal):
Collecting High-quality Multi-modal Conversational Search Data for E-Commerce (Collins et al., KnowledgeNLP-WS 2024)
Copy Citation:
PDF:
https://aclanthology.org/2024.knowledgenlp-1.3.pdf