Open-WikiTable : Dataset for Open Domain Question Answering with Complex Reasoning over Table

Sunjun Kweon, Yeonsu Kwon, Seonhee Cho, Yohan Jo, Edward Choi


Abstract
Despite recent interest in open domain question answering (ODQA) over tables, many studies still rely on datasets that are not truly optimal for the task with respect to utilizing structural nature of table. These datasets assume answers reside as a single cell value and do not necessitate exploring over multiple cells such as aggregation, comparison, and sorting. Thus, we release Open-WikiTable, the first ODQA dataset that requires complex reasoning over tables. Open-WikiTable is built upon WikiSQL and WikiTableQuestions to be applicable in the open-domain setting. As each question is coupled with both textual answers and SQL queries, Open-WikiTable opens up a wide range of possibilities for future research, as both reader and parser methods can be applied. The dataset is publicly available.
Anthology ID:
2023.findings-acl.526
Volume:
Findings of the Association for Computational Linguistics: ACL 2023
Month:
July
Year:
2023
Address:
Toronto, Canada
Editors:
Anna Rogers, Jordan Boyd-Graber, Naoaki Okazaki
Venue:
Findings
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
8285–8297
Language:
URL:
https://aclanthology.org/2023.findings-acl.526
DOI:
10.18653/v1/2023.findings-acl.526
Bibkey:
Cite (ACL):
Sunjun Kweon, Yeonsu Kwon, Seonhee Cho, Yohan Jo, and Edward Choi. 2023. Open-WikiTable : Dataset for Open Domain Question Answering with Complex Reasoning over Table. In Findings of the Association for Computational Linguistics: ACL 2023, pages 8285–8297, Toronto, Canada. Association for Computational Linguistics.
Cite (Informal):
Open-WikiTable : Dataset for Open Domain Question Answering with Complex Reasoning over Table (Kweon et al., Findings 2023)
Copy Citation:
PDF:
https://aclanthology.org/2023.findings-acl.526.pdf