WhisBERT: Multimodal Text-Audio Language Modeling on 100M Words Lukas Wolf author Klemen Kotar author Greta Tuckute author Eghbal Hosseini author Tamar I. Regev author Ethan Gotlieb Wilcox author Alexander Scott Warstadt author 2023-12 text Proceedings of the BabyLM Challenge at the 27th Conference on Computational Natural Language Learning Alex Warstadt editor Aaron Mueller editor Leshem Choshen editor Ethan Wilcox editor Chengxu Zhuang editor Juan Ciro editor Rafael Mosquera editor Bhargavi Paranjabe editor Adina Williams editor Tal Linzen editor Ryan Cotterell editor Association for Computational Linguistics Singapore conference publication wolf-etal-2023-whisbert 10.18653/v1/2023.conll-babylm.21 https://aclanthology.org/2023.conll-babylm.21/ 2023-12 253 258