Incorporating Question Answering-Based Signals into Abstractive Summarization via Salient Span Selection

Daniel Deutsch; Dan Roth

doi:10.18653/v1/2023.eacl-main.42

Incorporating Question Answering-Based Signals into Abstractive Summarization via Salient Span Selection

Abstract

In this work, we propose a method for incorporating question-answering (QA) signals into a summarization model. Our method identifies salient noun phrases (NPs) in the input document by automatically generating wh-questions that are answered by the NPs and automatically determining whether those questions are answered in the gold summaries. This QA-based signal is incorporated into a two-stage summarization model which first marks salient NPs in the input document using a classification model, then conditionally generates a summary. Our experiments demonstrate that the models trained using QA-based supervision generate higher-quality summaries than baseline methods of identifying salient spans on benchmark summarization datasets. Further, we show that the content of the generated summaries can be controlled based on which NPs are marked in the input document. Finally, we propose a method of augmenting the training data so the gold summaries are more consistent with the marked input spans used during training and show how this results in models which learn to better exclude unmarked document content.

Anthology ID:: 2023.eacl-main.42
Volume:: Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics
Month:: May
Year:: 2023
Address:: Dubrovnik, Croatia
Editors:: Andreas Vlachos, Isabelle Augenstein
Venue:: EACL
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 575–588
Language:
URL:: https://aclanthology.org/2023.eacl-main.42/
DOI:: 10.18653/v1/2023.eacl-main.42
Bibkey:
Cite (ACL):: Daniel Deutsch and Dan Roth. 2023. Incorporating Question Answering-Based Signals into Abstractive Summarization via Salient Span Selection. In Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics, pages 575–588, Dubrovnik, Croatia. Association for Computational Linguistics.
Cite (Informal):: Incorporating Question Answering-Based Signals into Abstractive Summarization via Salient Span Selection (Deutsch & Roth, EACL 2023)
Copy Citation:
PDF:: https://aclanthology.org/2023.eacl-main.42.pdf
Video:: https://aclanthology.org/2023.eacl-main.42.mp4

PDF Cite Search Video Fix data