Evaluating the Effects of Embedding with Speaker Identity Information in Dialogue Summarization

Yuji Naraki, Tetsuya Sakai, Yoshihiko Hayashi


Abstract
Automatic dialogue summarization is a task used to succinctly summarize a dialogue transcript while correctly linking the speakers and their speech, which distinguishes this task from a conventional document summarization. To address this issue and reduce the “who said what”-related errors in a summary, we propose embedding the speaker identity information in the input embedding into the dialogue transcript encoder. Unlike the speaker embedding proposed by Gu et al. (2020), our proposal takes into account the informativeness of position embedding. By experimentally comparing several embedding methods, we confirmed that the scores of ROUGE and a human evaluation of the generated summaries were substantially increased by embedding speaker information at the less informative part of the fixed position embedding with sinusoidal functions.
Anthology ID:
2022.lrec-1.31
Volume:
Proceedings of the Thirteenth Language Resources and Evaluation Conference
Month:
June
Year:
2022
Address:
Marseille, France
Editors:
Nicoletta Calzolari, Frédéric Béchet, Philippe Blache, Khalid Choukri, Christopher Cieri, Thierry Declerck, Sara Goggi, Hitoshi Isahara, Bente Maegaard, Joseph Mariani, Hélène Mazo, Jan Odijk, Stelios Piperidis
Venue:
LREC
SIG:
Publisher:
European Language Resources Association
Note:
Pages:
298–304
Language:
URL:
https://aclanthology.org/2022.lrec-1.31
DOI:
Bibkey:
Cite (ACL):
Yuji Naraki, Tetsuya Sakai, and Yoshihiko Hayashi. 2022. Evaluating the Effects of Embedding with Speaker Identity Information in Dialogue Summarization. In Proceedings of the Thirteenth Language Resources and Evaluation Conference, pages 298–304, Marseille, France. European Language Resources Association.
Cite (Informal):
Evaluating the Effects of Embedding with Speaker Identity Information in Dialogue Summarization (Naraki et al., LREC 2022)
Copy Citation:
PDF:
https://aclanthology.org/2022.lrec-1.31.pdf
Data
SAMSum