A Lightweight Mixture-of-Experts Neural Machine Translation Model with Stage-wise Training Strategy Fan Zhang author Mei Tu author Song Liu author Jinyao Yan author 2024-06 text Findings of the Association for Computational Linguistics: NAACL 2024 Kevin Duh editor Helena Gomez editor Steven Bethard editor Association for Computational Linguistics Mexico City, Mexico conference publication zhang-etal-2024-lightweight 10.18653/v1/2024.findings-naacl.154 https://aclanthology.org/2024.findings-naacl.154/ 2024-06 2381 2392