Learning to Create Sentence Semantic Relation Graphs for Multi-Document Summarization

Diego Antognini; Boi Faltings

doi:10.18653/v1/D19-5404

Learning to Create Sentence Semantic Relation Graphs for Multi-Document Summarization

Abstract

Linking facts across documents is a challenging task, as the language used to express the same information in a sentence can vary significantly, which complicates the task of multi-document summarization. Consequently, existing approaches heavily rely on hand-crafted features, which are domain-dependent and hard to craft, or additional annotated data, which is costly to gather. To overcome these limitations, we present a novel method, which makes use of two types of sentence embeddings: universal embeddings, which are trained on a large unrelated corpus, and domain-specific embeddings, which are learned during training. To this end, we develop SemSentSum, a fully data-driven model able to leverage both types of sentence embeddings by building a sentence semantic relation graph. SemSentSum achieves competitive results on two types of summary, consisting of 665 bytes and 100 words. Unlike other state-of-the-art models, neither hand-crafted features nor additional annotated data are necessary, and the method is easily adaptable for other tasks. To our knowledge, we are the first to use multiple sentence embeddings for the task of multi-document summarization.

Anthology ID:: D19-5404
Volume:: Proceedings of the 2nd Workshop on New Frontiers in Summarization
Month:: November
Year:: 2019
Address:: Hong Kong, China
Editors:: Lu Wang, Jackie Chi Kit Cheung, Giuseppe Carenini, Fei Liu
Venues:: NewSum | WS
SIG:: SIGSUMM
Publisher:: Association for Computational Linguistics
Note:
Pages:: 32–41
Language:
URL:: https://aclanthology.org/D19-5404/
DOI:: 10.18653/v1/D19-5404
Bibkey:
Cite (ACL):: Diego Antognini and Boi Faltings. 2019. Learning to Create Sentence Semantic Relation Graphs for Multi-Document Summarization. In Proceedings of the 2nd Workshop on New Frontiers in Summarization, pages 32–41, Hong Kong, China. Association for Computational Linguistics.
Cite (Informal):: Learning to Create Sentence Semantic Relation Graphs for Multi-Document Summarization (Antognini & Faltings, NewSum 2019)
Copy Citation:
PDF:: https://aclanthology.org/D19-5404.pdf

PDF Cite Search Fix data