Team Papelo at FEVEROUS: Multi-hop Evidence Pursuit

Christopher Malon


Abstract
We develop a system for the FEVEROUS fact extraction and verification task that ranks an initial set of potential evidence and then pursues missing evidence in subsequent hops by trying to generate it, with a “next hop prediction module” whose output is matched against page elements in a predicted article. Seeking evidence with the next hop prediction module continues to improve FEVEROUS score for up to seven hops. Label classification is trained on possibly incomplete extracted evidence chains, utilizing hints that facilitate numerical comparison. The system achieves .281 FEVEROUS score and .658 label accuracy on the development set, and finishes in second place with .259 FEVEROUS score and .576 label accuracy on the test set.
Anthology ID:
2021.fever-1.5
Volume:
Proceedings of the Fourth Workshop on Fact Extraction and VERification (FEVER)
Month:
November
Year:
2021
Address:
Dominican Republic
Venues:
EMNLP | FEVER
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
40–49
Language:
URL:
https://aclanthology.org/2021.fever-1.5
DOI:
Bibkey:
Copy Citation:
PDF:
https://aclanthology.org/2021.fever-1.5.pdf
Data
FEVERFEVEROUS