Tricks for Training Sparse Translation Models Dheeru Dua author Shruti Bhosale author Vedanuj Goswami author James Cross author Mike Lewis author Angela Fan author 2022-07 text Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies Marine Carpuat editor Marie-Catherine de Marneffe editor Ivan Vladimir Meza Ruiz editor Association for Computational Linguistics Seattle, United States conference publication dua-etal-2022-tricks 10.18653/v1/2022.naacl-main.244 https://aclanthology.org/2022.naacl-main.244/ 2022-07 3340 3345