Daniel Deutch
2023
Answering Questions by Meta-Reasoning over Multiple Chains of Thought
Ori Yoran
|
Tomer Wolfson
|
Ben Bogin
|
Uri Katz
|
Daniel Deutch
|
Jonathan Berant
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing
Modern systems for multi-hop question answering (QA) typically break questions into a sequence of reasoning steps, termed chain-of-thought (CoT), before arriving at a final answer. Often, multiple chains are sampled and aggregated through a voting mechanism over the final answers, but the intermediate steps themselves are discarded. While such approaches improve performance, they do not consider the relations between intermediate steps across chains and do not provide a unified explanation for the predicted answer. We introduce Multi-Chain Reasoning (MCR), an approach which prompts large language models to meta-reason over multiple chains of thought, rather than aggregate their answers. MCR examines different reasoning chains, mixes information between them and selects the most relevant facts in generating an explanation and predicting the answer. MCR outperforms strong baselines on 7 multi-hop QA datasets. Moreover, our analysis reveals that MCR explanations exhibit high quality, enabling humans to verify its answers.
2022
Weakly Supervised Text-to-SQL Parsing through Question Decomposition
Tomer Wolfson
|
Daniel Deutch
|
Jonathan Berant
Findings of the Association for Computational Linguistics: NAACL 2022
Text-to-SQL parsers are crucial in enabling non-experts to effortlessly query relational data. Training such parsers, by contrast, generally requires expertise in annotating natural language (NL) utterances with corresponding SQL queries. In this work, we propose a weak supervision approach for training text-to-SQL parsers. We take advantage of the recently proposed question meaning representation called QDMR, an intermediate between NL and formal query languages. Given questions, their QDMR structures (annotated by non-experts or automatically predicted), and the answers, we are able to automatically synthesize SQL queries that are used to train text-to-SQL models. We test our approach by experimenting on five benchmark datasets. Our results show that the weakly supervised models perform competitively with those trained on annotated NL-SQL data. Overall, we effectively train text-to-SQL parsers, while using zero SQL annotations.
2020
Break It Down: A Question Understanding Benchmark
Tomer Wolfson
|
Mor Geva
|
Ankit Gupta
|
Matt Gardner
|
Yoav Goldberg
|
Daniel Deutch
|
Jonathan Berant
Transactions of the Association for Computational Linguistics, Volume 8
Understanding natural language questions entails the ability to break down a question into the requisite steps for computing its answer. In this work, we introduce a Question Decomposition Meaning Representation (QDMR) for questions. QDMR constitutes the ordered list of steps, expressed through natural language, that are necessary for answering a question. We develop a crowdsourcing pipeline, showing that quality QDMRs can be annotated at scale, and release the Break dataset, containing over 83K pairs of questions and their QDMRs. We demonstrate the utility of QDMR by showing that (a) it can be used to improve open-domain question answering on the HotpotQA dataset, (b) it can be deterministically converted to a pseudo-SQL formal language, which can alleviate annotation in semantic parsing applications. Last, we use Break to train a sequence-to-sequence model with copying that parses questions into QDMR structures, and show that it substantially outperforms several natural baselines.
Search
Fix data
Co-authors
- Jonathan Berant 3
- Tomer Wolfson 3
- Ben Bogin 1
- Matt Gardner 1
- Mor Geva 1
- show all...