Exploring Lottery Prompts for Pre-trained Language Models

Yulin Chen; Ning Ding; Xiaobin Wang; Shengding Hu; Haitao Zheng; Zhiyuan Liu; Pengjun Xie

doi:10.18653/v1/2023.acl-long.860

Exploring Lottery Prompts for Pre-trained Language Models

Yulin Chen, Ning Ding, Xiaobin Wang, Shengding Hu, Haitao Zheng, Zhiyuan Liu, Pengjun Xie

Abstract

Consistently scaling pre-trained language models (PLMs) imposes substantial burdens on model adaptation, necessitating more efficient alternatives to conventional fine-tuning. Given the advantage of prompting in the zero-shot setting and the observed performance fluctuation among different prompts, we explore the instance-level prompt and their generalizability.By searching through the prompt space, we first validate the assumption that for every instance, there is almost always a lottery prompt that induces the correct prediction from the PLM, and such prompt can be obtained at a low cost thanks to the inherent ability of PLMs.Meanwhile, it is shown that some strong lottery prompts have high performance over the whole training set, and they are equipped with distinguishable linguistic features. Lastly, we attempt to generalize the searched strong lottery prompts to unseen data with prompt ensembling method. Experiments are conducted on various types of NLP classification tasks and demonstrate that the proposed method can achieve comparable results with other gradient-free and optimization-free baselines.

Anthology ID:: 2023.acl-long.860
Volume:: Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Month:: July
Year:: 2023
Address:: Toronto, Canada
Editors:: Anna Rogers, Jordan Boyd-Graber, Naoaki Okazaki
Venue:: ACL
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 15428–15444
Language:
URL:: https://aclanthology.org/2023.acl-long.860
DOI:: 10.18653/v1/2023.acl-long.860
Bibkey:
Cite (ACL):: Yulin Chen, Ning Ding, Xiaobin Wang, Shengding Hu, Haitao Zheng, Zhiyuan Liu, and Pengjun Xie. 2023. Exploring Lottery Prompts for Pre-trained Language Models. In Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 15428–15444, Toronto, Canada. Association for Computational Linguistics.
Cite (Informal):: Exploring Lottery Prompts for Pre-trained Language Models (Chen et al., ACL 2023)
Copy Citation:
PDF:: https://aclanthology.org/2023.acl-long.860.pdf
Video:: https://aclanthology.org/2023.acl-long.860.mp4

PDF Cite Search Video