Are We Summarizing the Right Way? A Survey of Dialogue Summarization Data Sets

Don Tuggener, Margot Mieskes, Jan Deriu, Mark Cieliebak


Abstract
Dialogue summarization is a long-standing task in the field of NLP, and several data sets with dialogues and associated human-written summaries of different styles exist. However, it is unclear for which type of dialogue which type of summary is most appropriate. For this reason, we apply a linguistic model of dialogue types to derive matching summary items and NLP tasks. This allows us to map existing dialogue summarization data sets into this model and identify gaps and potential directions for future work. As part of this process, we also provide an extensive overview of existing dialogue summarization data sets.
Anthology ID:
2021.newsum-1.12
Volume:
Proceedings of the Third Workshop on New Frontiers in Summarization
Month:
November
Year:
2021
Address:
Online and in Dominican Republic
Editors:
Giuseppe Carenini, Jackie Chi Kit Cheung, Yue Dong, Fei Liu, Lu Wang
Venue:
NewSum
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
107–118
Language:
URL:
https://aclanthology.org/2021.newsum-1.12
DOI:
10.18653/v1/2021.newsum-1.12
Bibkey:
Cite (ACL):
Don Tuggener, Margot Mieskes, Jan Deriu, and Mark Cieliebak. 2021. Are We Summarizing the Right Way? A Survey of Dialogue Summarization Data Sets. In Proceedings of the Third Workshop on New Frontiers in Summarization, pages 107–118, Online and in Dominican Republic. Association for Computational Linguistics.
Cite (Informal):
Are We Summarizing the Right Way? A Survey of Dialogue Summarization Data Sets (Tuggener et al., NewSum 2021)
Copy Citation:
PDF:
https://aclanthology.org/2021.newsum-1.12.pdf
Video:
 https://aclanthology.org/2021.newsum-1.12.mp4