OFA: A Framework of Initializing Unseen Subword Embeddings for Efficient Large-scale Multilingual Continued Pretraining Yihong Liu author Peiqin Lin author Mingyang Wang author Hinrich Schuetze author 2024-06 text Findings of the Association for Computational Linguistics: NAACL 2024 Kevin Duh editor Helena Gomez editor Steven Bethard editor Association for Computational Linguistics Mexico City, Mexico conference publication liu-etal-2024-ofa 10.18653/v1/2024.findings-naacl.68 https://aclanthology.org/2024.findings-naacl.68/ 2024-06 1067 1097