Allocating Large Vocabulary Capacity for Cross-Lingual Language Model Pre-Training Bo Zheng author Li Dong author Shaohan Huang author Saksham Singhal author Wanxiang Che author Ting Liu author Xia Song author Furu Wei author 2021-11 text Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing Marie-Francine Moens editor Xuanjing Huang editor Lucia Specia editor Scott Wen-tau Yih editor Association for Computational Linguistics Online and Punta Cana, Dominican Republic conference publication zheng-etal-2021-allocating 10.18653/v1/2021.emnlp-main.257 https://aclanthology.org/2021.emnlp-main.257/ 2021-11 3203 3215