Coarse-to-Fine Query Focused Multi-Document Summarization

Yumo Xu, Mirella Lapata


Abstract
We consider the problem of better modeling query-cluster interactions to facilitate query focused multi-document summarization. Due to the lack of training data, existing work relies heavily on retrieval-style methods for assembling query relevant summaries. We propose a coarse-to-fine modeling framework which employs progressively more accurate modules for estimating whether text segments are relevant, likely to contain an answer, and central. The modules can be independently developed and leverage training data if available. We present an instantiation of this framework with a trained evidence estimator which relies on distant supervision from question answering (where various resources exist) to identify segments which are likely to answer the query and should be included in the summary. Our framework is robust across domains and query types (i.e., long vs short) and outperforms strong comparison systems on benchmark datasets.
Anthology ID:
2020.emnlp-main.296
Volume:
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)
Month:
November
Year:
2020
Address:
Online
Venue:
EMNLP
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
3632–3645
Language:
URL:
https://aclanthology.org/2020.emnlp-main.296
DOI:
10.18653/v1/2020.emnlp-main.296
Bibkey:
Cite (ACL):
Yumo Xu and Mirella Lapata. 2020. Coarse-to-Fine Query Focused Multi-Document Summarization. In Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP), pages 3632–3645, Online. Association for Computational Linguistics.
Cite (Informal):
Coarse-to-Fine Query Focused Multi-Document Summarization (Xu & Lapata, EMNLP 2020)
Copy Citation:
PDF:
https://aclanthology.org/2020.emnlp-main.296.pdf
Video:
 https://slideslive.com/38938758