Pre-training LLMs using human-like development data corpus Khushi Bhardwaj author Raj Sanjay Shah author Sashank Varma author 2023-12 text Proceedings of the BabyLM Challenge at the 27th Conference on Computational Natural Language Learning Alex Warstadt editor Aaron Mueller editor Leshem Choshen editor Ethan Wilcox editor Chengxu Zhuang editor Juan Ciro editor Rafael Mosquera editor Bhargavi Paranjabe editor Adina Williams editor Tal Linzen editor Ryan Cotterell editor Association for Computational Linguistics Singapore conference publication bhardwaj-etal-2023-pre 10.18653/v1/2023.conll-babylm.30 https://aclanthology.org/2023.conll-babylm.30/ 2023-12 339 345