Few-shot Adaptation Works with UnpredicTable Data

Jun Shern Chan; Michael Pieler; Jonathan Jao; Jérémy Scheurer; Ethan Perez

doi:10.18653/v1/2023.acl-long.102

Few-shot Adaptation Works with UnpredicTable Data

Jun Shern Chan, Michael Pieler, Jonathan Jao, Jérémy Scheurer, Ethan Perez

Abstract

Prior work on language models (LMs) shows that training on a large number of diverse tasks improves few-shot learning (FSL) performance on new tasks. We take this to the extreme, automatically extracting 413,299 tasks from internet tables - orders of magnitude more than the next-largest public datasets. Finetuning on the resulting dataset leads to improved FSL performance on Natural Language Processing (NLP) tasks, but not proportionally to dataset scale. In fact, we find that narrow subsets of our dataset sometimes outperform more diverse datasets. For example, finetuning on software documentation from support.google.com raises FSL performance by a mean of +7.5% on 52 downstream tasks, which beats training on 40 human-curated NLP datasets (+6.7%). Finetuning on various narrow datasets leads to similar broad improvements across test tasks, suggesting that the gains are not from domain adaptation but adapting to FSL in general. We do not observe clear patterns between the datasets that lead to FSL gains, leaving open questions about why certain data helps with FSL.

Anthology ID:: 2023.acl-long.102
Volume:: Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Month:: July
Year:: 2023
Address:: Toronto, Canada
Editors:: Anna Rogers, Jordan Boyd-Graber, Naoaki Okazaki
Venue:: ACL
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 1806–1842
Language:
URL:: https://aclanthology.org/2023.acl-long.102/
DOI:: 10.18653/v1/2023.acl-long.102
Bibkey:
Cite (ACL):: Jun Shern Chan, Michael Pieler, Jonathan Jao, Jérémy Scheurer, and Ethan Perez. 2023. Few-shot Adaptation Works with UnpredicTable Data. In Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 1806–1842, Toronto, Canada. Association for Computational Linguistics.
Cite (Informal):: Few-shot Adaptation Works with UnpredicTable Data (Chan et al., ACL 2023)
Copy Citation:
PDF:: https://aclanthology.org/2023.acl-long.102.pdf
Video:: https://aclanthology.org/2023.acl-long.102.mp4

PDF Cite Search Video Fix data