S4-Tuning: A Simple Cross-lingual Sub-network Tuning Method

Runxin Xu; Fuli Luo; Baobao Chang (常宝宝); Songfang Huang; Fei Huang

doi:10.18653/v1/2022.acl-short.58

S⁴-Tuning: A Simple Cross-lingual Sub-network Tuning Method

Runxin Xu, Fuli Luo, Baobao Chang, Songfang Huang, Fei Huang

Abstract

The emergence of multilingual pre-trained language models makes it possible to adapt to target languages with only few labeled examples. However, vanilla fine-tuning tends to achieve degenerated and unstable results, owing to the Language Interference among different languages, and Parameter Overload under the few-sample transfer learning scenarios. To address two problems elegantly, we propose S⁴-Tuning, a Simple Cross-lingual Sub-network Tuning method. S⁴-Tuning first detects the most essential sub-network for each target language, and only updates it during fine-tuning.In this way, the language sub-networks lower the scale of trainable parameters, and hence better suit the low-resource scenarios.Meanwhile, the commonality and characteristics across languages are modeled by the overlapping and non-overlapping parts to ease the interference among languages.Simple but effective, S⁴-Tuning gains consistent improvements over vanilla fine-tuning on three multi-lingual tasks involving 37 different languages in total (XNLI, PAWS-X, and Tatoeba).

Anthology ID:: 2022.acl-short.58
Volume:: Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers)
Month:: May
Year:: 2022
Address:: Dublin, Ireland
Editors:: Smaranda Muresan, Preslav Nakov, Aline Villavicencio
Venue:: ACL
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 530–537
Language:
URL:: https://aclanthology.org/2022.acl-short.58/
DOI:: 10.18653/v1/2022.acl-short.58
Bibkey:
Cite (ACL):: Runxin Xu, Fuli Luo, Baobao Chang, Songfang Huang, and Fei Huang. 2022. S4-Tuning: A Simple Cross-lingual Sub-network Tuning Method. In Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), pages 530–537, Dublin, Ireland. Association for Computational Linguistics.
Cite (Informal):: S4-Tuning: A Simple Cross-lingual Sub-network Tuning Method (Xu et al., ACL 2022)
Copy Citation:
PDF:: https://aclanthology.org/2022.acl-short.58.pdf

PDF Cite Search Fix data