Better Zero-Shot Reasoning with Self-Adaptive Prompting

Xingchen Wan; Ruoxi Sun; Hanjun Dai; Sercan O Arik; Tomas Pfister

doi:10.18653/v1/2023.findings-acl.216

Better Zero-Shot Reasoning with Self-Adaptive Prompting

Xingchen Wan, Ruoxi Sun, Hanjun Dai, Sercan Arik, Tomas Pfister

Abstract

Modern large language models (LLMs) have demonstrated impressive capabilities at sophisticated tasks, often through step-by-step reasoning similar to humans. This is made possible by their strong few- and zero-shot abilities – they can effectively learn from a handful of handcrafted, completed responses (“in-context examples”), or are prompted to reason spontaneously through specially designed triggers. Nonetheless, some limitations have been observed. First, performance in the few-shot setting is sensitive to the choice of the examples, whose design requires significant human effort. Moreover, given the diverse downstream tasks of LLMs, it may be difficult or laborious to handcraft per-task labels. Second, while the zero-shot setting does not require handcrafting, its performance is limited due to the lack of guidance to the LLMs. To address these limitations, we propose Consistency-based Self-adaptive Prompting (COSP), a novel prompt design method for LLMs. Requiring neither handcrafted responses nor ground-truth labels, COSP selects and builds the set of examples from the LLM zero-shot outputs via carefully designed criteria combining consistency, diversity and repetition. In the zero-shot setting for three different LLMs, we show that using only LLM predictions, COSP significantly improves performance up to 15% compared to zero-shot baselines and matches or exceeds few-shot baselines at a range of reasoning tasks.

Anthology ID:: 2023.findings-acl.216
Volume:: Findings of the Association for Computational Linguistics: ACL 2023
Month:: July
Year:: 2023
Address:: Toronto, Canada
Editors:: Anna Rogers, Jordan Boyd-Graber, Naoaki Okazaki
Venue:: Findings
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 3493–3514
Language:
URL:: https://aclanthology.org/2023.findings-acl.216/
DOI:: 10.18653/v1/2023.findings-acl.216
Bibkey:
Cite (ACL):: Xingchen Wan, Ruoxi Sun, Hanjun Dai, Sercan Arik, and Tomas Pfister. 2023. Better Zero-Shot Reasoning with Self-Adaptive Prompting. In Findings of the Association for Computational Linguistics: ACL 2023, pages 3493–3514, Toronto, Canada. Association for Computational Linguistics.
Cite (Informal):: Better Zero-Shot Reasoning with Self-Adaptive Prompting (Wan et al., Findings 2023)
Copy Citation:
PDF:: https://aclanthology.org/2023.findings-acl.216.pdf
Video:: https://aclanthology.org/2023.findings-acl.216.mp4

PDF Cite Search Video Fix data