Training Trajectories of Language Models Across Scales Mengzhou Xia author Mikel Artetxe author Chunting Zhou author Xi Victoria Lin author Ramakanth Pasunuru author Danqi Chen author Luke Zettlemoyer author Veselin Stoyanov author 2023-07 text Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) Anna Rogers editor Jordan Boyd-Graber editor Naoaki Okazaki editor Association for Computational Linguistics Toronto, Canada conference publication xia-etal-2023-training 10.18653/v1/2023.acl-long.767 https://aclanthology.org/2023.acl-long.767/ 2023-07 13711 13738