What Language Model to Train if You Have One Million GPU Hours? Teven Le Scao author Thomas Wang author Daniel Hesslow author Stas Bekman author M Saiful Bari author Stella Biderman author Hady Elsahar author Niklas Muennighoff author Jason Phang author Ofir Press author Colin Raffel author Victor Sanh author Sheng Shen author Lintang Sutawika author Jaesung Tae author Zheng Xin Yong author Julien Launay author Iz Beltagy author 2022-12 text Findings of the Association for Computational Linguistics: EMNLP 2022 Yoav Goldberg editor Zornitsa Kozareva editor Yue Zhang editor Association for Computational Linguistics Abu Dhabi, United Arab Emirates conference publication le-scao-etal-2022-language 10.18653/v1/2022.findings-emnlp.54 https://aclanthology.org/2022.findings-emnlp.54/ 2022-12 765 782