Multi-Stage Multi-Modal Pre-Training for Automatic Speech Recognition Yash Jain author David M Chan author Pranav Dheram author Aparna Khare author Olabanji Shonibare author Venkatesh Ravichandran author Shalini Ghosh author 2024-05 text Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024) Nicoletta Calzolari editor Min-Yen Kan editor Veronique Hoste editor Alessandro Lenci editor Sakriani Sakti editor Nianwen Xue editor ELRA and ICCL Torino, Italia conference publication jain-etal-2024-multi https://aclanthology.org/2024.lrec-main.1045/ 2024-05 11969 11980