QAMPARI: A Benchmark for Open-domain Questions with Many Answers

Samuel Amouyal; Tomer Wolfson; Ohad Rubin; Ori Yoran; Jonathan Herzig; Jonathan Berant

QAMPARI: A Benchmark for Open-domain Questions with Many Answers

Samuel Amouyal, Tomer Wolfson, Ohad Rubin, Ori Yoran, Jonathan Herzig, Jonathan Berant

Abstract

Existing benchmarks for open-domain question answering (ODQA) typically focus on questions whose answers are all in a single paragraph. By contrast, many natural questions, such as “What players were drafted by the Brooklyn Nets?” have a long list of answers extracted from multiple paragraphs. Answering such questions requires retrieving and reading many passages from a large corpus. We introduce QAMPARI, an ODQA benchmark, where answers are lists of entities, spread across many paragraphs. We created QAMPARI by (a) generating questions with multiple answers from Wikipedia’s knowledge graph and tables, (b) automatically pairing answers with supporting evidence in Wikipedia paragraphs, and (c) manually paraphrasing questions and validating each answer. Across a wide range of ODQA models, we find that QAMPARI is challenging in terms of both passage retrieval and answer generation, with models reaching an F1 score of 32.8 at best. We view QAMPARI as a valuable resource for ODQA research, which will aid to develop models that handle a broad range of question types, including single and multi-answer questions.

Anthology ID:: 2023.gem-1.9
Volume:: Proceedings of the Third Workshop on Natural Language Generation, Evaluation, and Metrics (GEM)
Month:: December
Year:: 2023
Address:: Singapore
Editors:: Sebastian Gehrmann, Alex Wang, João Sedoc, Elizabeth Clark, Kaustubh Dhole, Khyathi Raghavi Chandu, Enrico Santus, Hooman Sedghamiz
Venues:: GEM | WS
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 97–110
Language:
URL:: https://aclanthology.org/2023.gem-1.9/
DOI:
Bibkey:
Cite (ACL):: Samuel Amouyal, Tomer Wolfson, Ohad Rubin, Ori Yoran, Jonathan Herzig, and Jonathan Berant. 2023. QAMPARI: A Benchmark for Open-domain Questions with Many Answers. In Proceedings of the Third Workshop on Natural Language Generation, Evaluation, and Metrics (GEM), pages 97–110, Singapore. Association for Computational Linguistics.
Cite (Informal):: QAMPARI: A Benchmark for Open-domain Questions with Many Answers (Amouyal et al., GEM 2023)
Copy Citation:
PDF:: https://aclanthology.org/2023.gem-1.9.pdf

PDF Cite Search Fix data