Fast Evidence Extraction for Grounded Language Model Outputs

Pranav Mani; Davis Liang; Zachary Chase Lipton

Fast Evidence Extraction for Grounded Language Model Outputs

Pranav Mani, Davis Liang, Zachary Chase Lipton

Abstract

Summarizing documents with Large Language Models (LLMs) warrants a rigorous inspection of the resulting outputs by humans. However, unaided verification of generated outputs is time-intensive and intractable at scale. For high-stakes applications like healthcare where verification is necessary, expediting this step can unlock massive gains in productivity. In this paper, we focus on the task of evidence extraction for abstractive summarization: for each summary line, extract the corresponding evidence spans from a source document. Viewing this evidence extraction problem through the lens of extractive question answering, we train a set of fast and scalable hierarchical architectures: EarlyFusion, MidFusion, and LateFusion. Our experiments show that (i) our method outperforms the state-of-the-art by 1.4% relative F1-Score; (ii) our model architecture reduces latency by 4x over a RoBERTa-Large baseline; and (iii) pretraining on an extractive QA corpus confers positive transfer to evidence extraction, especially in low-resource regimes.

Anthology ID:: 2024.fever-1.24
Volume:: Proceedings of the Seventh Fact Extraction and VERification Workshop (FEVER)
Month:: November
Year:: 2024
Address:: Miami, Florida, USA
Editors:: Michael Schlichtkrull, Yulong Chen, Chenxi Whitehouse, Zhenyun Deng, Mubashara Akhtar, Rami Aly, Zhijiang Guo, Christos Christodoulopoulos, Oana Cocarascu, Arpit Mittal, James Thorne, Andreas Vlachos
Venue:: FEVER
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 205–218
Language:
URL:: https://aclanthology.org/2024.fever-1.24
DOI:
Bibkey:
Cite (ACL):: Pranav Mani, Davis Liang, and Zachary Chase Lipton. 2024. Fast Evidence Extraction for Grounded Language Model Outputs. In Proceedings of the Seventh Fact Extraction and VERification Workshop (FEVER), pages 205–218, Miami, Florida, USA. Association for Computational Linguistics.
Cite (Informal):: Fast Evidence Extraction for Grounded Language Model Outputs (Mani et al., FEVER 2024)
Copy Citation:
PDF:: https://aclanthology.org/2024.fever-1.24.pdf

PDF Cite Search