HighRES: Highlight-based Reference-less Evaluation of Summarization

Hardy; Shashi Narayan; Andreas Vlachos

doi:10.18653/v1/P19-1330

HighRES: Highlight-based Reference-less Evaluation of Summarization

Abstract

There has been substantial progress in summarization research enabled by the availability of novel, often large-scale, datasets and recent advances on neural network-based approaches. However, manual evaluation of the system generated summaries is inconsistent due to the difficulty the task poses to human non-expert readers. To address this issue, we propose a novel approach for manual evaluation, Highlight-based Reference-less Evaluation of Summarization (HighRES), in which summaries are assessed by multiple annotators against the source document via manually highlighted salient content in the latter. Thus summary assessment on the source document by human judges is facilitated, while the highlights can be used for evaluating multiple systems. To validate our approach we employ crowd-workers to augment with highlights a recently proposed dataset and compare two state-of-the-art systems. We demonstrate that HighRES improves inter-annotator agreement in comparison to using the source document directly, while they help emphasize differences among systems that would be ignored under other evaluation approaches.

Anthology ID:: P19-1330
Volume:: Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics
Month:: July
Year:: 2019
Address:: Florence, Italy
Editors:: Anna Korhonen, David Traum, Lluís Màrquez
Venue:: ACL
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 3381–3392
Language:
URL:: https://aclanthology.org/P19-1330/
DOI:: 10.18653/v1/P19-1330
Bibkey:
Cite (ACL):: Hardy, Shashi Narayan, and Andreas Vlachos. 2019. HighRES: Highlight-based Reference-less Evaluation of Summarization. In Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, pages 3381–3392, Florence, Italy. Association for Computational Linguistics.
Cite (Informal):: HighRES: Highlight-based Reference-less Evaluation of Summarization (Hardy et al., ACL 2019)
Copy Citation:
PDF:: https://aclanthology.org/P19-1330.pdf
Supplementary:: P19-1330.Supplementary.pdf
Video:: https://aclanthology.org/P19-1330.mp4

PDF Cite Search Supplementary Video Fix data