Learning Language Specific Sub-network for Multilingual Machine Translation

Zehui Lin; Liwei Wu; Mingxuan Wang; Lei Li

doi:10.18653/v1/2021.acl-long.25

Learning Language Specific Sub-network for Multilingual Machine Translation

Zehui Lin, Liwei Wu, Mingxuan Wang, Lei Li

Abstract

Multilingual neural machine translation aims at learning a single translation model for multiple languages. These jointly trained models often suffer from performance degradationon rich-resource language pairs. We attribute this degeneration to parameter interference. In this paper, we propose LaSS to jointly train a single unified multilingual MT model. LaSS learns Language Specific Sub-network (LaSS) for each language pair to counter parameter interference. Comprehensive experiments on IWSLT and WMT datasets with various Transformer architectures show that LaSS obtains gains on 36 language pairs by up to 1.2 BLEU. Besides, LaSS shows its strong generalization performance at easy adaptation to new language pairs and zero-shot translation. LaSS boosts zero-shot translation with an average of 8.3 BLEU on 30 language pairs. Codes and trained models are available at https://github.com/NLP-Playground/LaSS.

Anthology ID:: 2021.acl-long.25
Volume:: Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers)
Month:: August
Year:: 2021
Address:: Online
Editors:: Chengqing Zong, Fei Xia, Wenjie Li, Roberto Navigli
Venues:: ACL | IJCNLP
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 293–305
Language:
URL:: https://aclanthology.org/2021.acl-long.25/
DOI:: 10.18653/v1/2021.acl-long.25
Bibkey:
Cite (ACL):: Zehui Lin, Liwei Wu, Mingxuan Wang, and Lei Li. 2021. Learning Language Specific Sub-network for Multilingual Machine Translation. In Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers), pages 293–305, Online. Association for Computational Linguistics.
Cite (Informal):: Learning Language Specific Sub-network for Multilingual Machine Translation (Lin et al., ACL-IJCNLP 2021)
Copy Citation:
PDF:: https://aclanthology.org/2021.acl-long.25.pdf
Video:: https://aclanthology.org/2021.acl-long.25.mp4

PDF Cite Search Video Fix data