Cross-Lingual Transfer with Target Language-Ready Task Adapters

Marinela Parović; Alan Ansell; Ivan Vulić; Anna Korhonen

doi:10.18653/v1/2023.findings-acl.13

Cross-Lingual Transfer with Target Language-Ready Task Adapters

Marinela Parovic, Alan Ansell, Ivan Vulić, Anna Korhonen

Abstract

Adapters have emerged as a modular and parameter-efficient approach to (zero-shot) cross-lingual transfer. The established MAD-X framework employs separate language and task adapters which can be arbitrarily combined to perform the transfer of any task to any target language. Subsequently, BAD-X, an extension of the MAD-X framework, achieves improved transfer at the cost of MAD-X’s modularity by creating ‘bilingual’ adapters specific to the source-target language pair. In this work, we aim to take the best of both worlds by (i) fine-tuning *task* adapters adapted to the target language(s) (so-called *‘target language-ready’ (TLR)* adapters) to maintain high transfer performance, but (ii) without sacrificing the highly modular design of MAD-X. The main idea of ‘target language-ready’ adapters is to resolve the training-vs-inference discrepancy of MAD-X: the task adapter ‘sees’ the target language adapter for the very first time during inference, and thus might not be fully compatible with it. We address this mismatch by exposing the task adapter to the target language adapter during training, and empirically validate several variants of the idea: in the simplest form, we alternate between using the source and target language adapters during task adapter training, which can be generalized to cycling over any set of language adapters. We evaluate different TLR-based transfer configurations with varying degrees of generality across a suite of standard cross-lingual benchmarks, and find that the most general (and thus most modular) configuration consistently outperforms MAD-X and BAD-X on most tasks and languages.

Anthology ID:: 2023.findings-acl.13
Volume:: Findings of the Association for Computational Linguistics: ACL 2023
Month:: July
Year:: 2023
Address:: Toronto, Canada
Editors:: Anna Rogers, Jordan Boyd-Graber, Naoaki Okazaki
Venue:: Findings
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 176–193
Language:
URL:: https://aclanthology.org/2023.findings-acl.13
DOI:: 10.18653/v1/2023.findings-acl.13
Bibkey:
Cite (ACL):: Marinela Parovic, Alan Ansell, Ivan Vulić, and Anna Korhonen. 2023. Cross-Lingual Transfer with Target Language-Ready Task Adapters. In Findings of the Association for Computational Linguistics: ACL 2023, pages 176–193, Toronto, Canada. Association for Computational Linguistics.
Cite (Informal):: Cross-Lingual Transfer with Target Language-Ready Task Adapters (Parovic et al., Findings 2023)
Copy Citation:
PDF:: https://aclanthology.org/2023.findings-acl.13.pdf

PDF Cite Search