MultiDoc2Dial: Modeling Dialogues Grounded in Multiple Documents

Song Feng, Siva Sankalp Patel, Hui Wan, Sachindra Joshi


Abstract
We propose MultiDoc2Dial, a new task and dataset on modeling goal-oriented dialogues grounded in multiple documents. Most previous works treat document-grounded dialogue modeling as machine reading comprehension task based on a single given document or passage. In this work, we aim to address more realistic scenarios where a goal-oriented information-seeking conversation involves multiple topics, and hence is grounded on different documents. To facilitate such task, we introduce a new dataset that contains dialogues grounded in multiple documents from four different domains. We also explore modeling the dialogue-based and document-based contexts in the dataset. We present strong baseline approaches and various experimental results, aiming to support further research efforts on such a task.
Anthology ID:
2021.emnlp-main.498
Volume:
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing
Month:
November
Year:
2021
Address:
Online and Punta Cana, Dominican Republic
Editors:
Marie-Francine Moens, Xuanjing Huang, Lucia Specia, Scott Wen-tau Yih
Venue:
EMNLP
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
6162–6176
Language:
URL:
https://aclanthology.org/2021.emnlp-main.498
DOI:
10.18653/v1/2021.emnlp-main.498
Bibkey:
Cite (ACL):
Song Feng, Siva Sankalp Patel, Hui Wan, and Sachindra Joshi. 2021. MultiDoc2Dial: Modeling Dialogues Grounded in Multiple Documents. In Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, pages 6162–6176, Online and Punta Cana, Dominican Republic. Association for Computational Linguistics.
Cite (Informal):
MultiDoc2Dial: Modeling Dialogues Grounded in Multiple Documents (Feng et al., EMNLP 2021)
Copy Citation:
PDF:
https://aclanthology.org/2021.emnlp-main.498.pdf
Video:
 https://aclanthology.org/2021.emnlp-main.498.mp4
Code
 IBM/multidoc2dial
Data
MultiDoc2DialDoQADoc2DialNatural QuestionsQuACShARCdoc2dial