Mini But Mighty: Efficient Multilingual Pretraining with Linguistically-Informed Data Selection Tolúlọpẹ́ Ògúnrẹ̀mí author Dan Jurafsky author Christopher D Manning author 2023-05 text Findings of the Association for Computational Linguistics: EACL 2023 Andreas Vlachos editor Isabelle Augenstein editor Association for Computational Linguistics Dubrovnik, Croatia conference publication ogunremi-etal-2023-mini 10.18653/v1/2023.findings-eacl.93 https://aclanthology.org/2023.findings-eacl.93/ 2023-05 1251 1266