SEER : A Knapsack approach to Exemplar Selection for In-Context HybridQA

Jonathan Tonglet, Manon Reusens, Philipp Borchert, Bart Baesens


Abstract
Question answering over hybrid contexts is a complex task, which requires the combination of information extracted from unstructured texts and structured tables in various ways. Recently, In-Context Learning demonstrated significant performance advances for reasoning tasks. In this paradigm, a large language model performs predictions based on a small set of supporting exemplars. The performance of In-Context Learning depends heavily on the selection procedure of the supporting exemplars, particularly in the case of HybridQA, where considering the diversity of reasoning chains and the large size of the hybrid contexts becomes crucial. In this work, we present Selection of ExEmplars for hybrid Reasoning (SEER), a novel method for selecting a set of exemplars that is both representative and diverse. The key novelty of SEER is that it formulates exemplar selection as a Knapsack Integer Linear Program. The Knapsack framework provides the flexibility to incorporate diversity constraints that prioritize exemplars with desirable attributes, and capacity constraints that ensure that the prompt size respects the provided capacity budgets. The effectiveness of SEER is demonstrated on FinQA and TAT-QA, two real-world benchmarks for HybridQA, where it outperforms previous exemplar selection methods.
Anthology ID:
2023.emnlp-main.837
Volume:
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing
Month:
December
Year:
2023
Address:
Singapore
Editors:
Houda Bouamor, Juan Pino, Kalika Bali
Venue:
EMNLP
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
13569–13583
Language:
URL:
https://aclanthology.org/2023.emnlp-main.837
DOI:
10.18653/v1/2023.emnlp-main.837
Bibkey:
Cite (ACL):
Jonathan Tonglet, Manon Reusens, Philipp Borchert, and Bart Baesens. 2023. SEER : A Knapsack approach to Exemplar Selection for In-Context HybridQA. In Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, pages 13569–13583, Singapore. Association for Computational Linguistics.
Cite (Informal):
SEER : A Knapsack approach to Exemplar Selection for In-Context HybridQA (Tonglet et al., EMNLP 2023)
Copy Citation:
PDF:
https://aclanthology.org/2023.emnlp-main.837.pdf
Video:
 https://aclanthology.org/2023.emnlp-main.837.mp4