Farewell to Aimless Large-scale Pretraining: Influential Subset Selection for Language Model Xiao Wang author Weikang Zhou author Qi Zhang author Jie Zhou author SongYang Gao author Junzhe Wang author Menghan Zhang author Xiang Gao author Yun Wen Chen author Tao Gui author 2023-07 text Findings of the Association for Computational Linguistics: ACL 2023 Anna Rogers editor Jordan Boyd-Graber editor Naoaki Okazaki editor Association for Computational Linguistics Toronto, Canada conference publication wang-etal-2023-farewell 10.18653/v1/2023.findings-acl.35 https://aclanthology.org/2023.findings-acl.35/ 2023-07 555 568