How Robust is Neural Machine Translation to Language Imbalance in Multilingual Tokenizer Training? Shiyue Zhang author Vishrav Chaudhary author Naman Goyal author James Cross author Guillaume Wenzek author Mohit Bansal author Francisco Guzman author 2022-09 text Proceedings of the 15th biennial conference of the Association for Machine Translation in the Americas (Volume 1: Research Track) Kevin Duh editor Francisco Guzmán editor Association for Machine Translation in the Americas Orlando, USA conference publication zhang-etal-2022-robust https://aclanthology.org/2022.amta-research.8/ 2022-09 97 116