Instance-Level Dynamic LoRAs Composition for Cross-Task Generalization

Zhiqi Wang, Shizhu He, Kang Liu, Jun Zhao


Abstract
Large language models perform well on tasks that have undergone fine-tuning of instructions, but their performance on completely unseen tasks is often less than ideal. To overcome the challenge of cross-task generalization, task-level LoRAs combination is proposed, which does not require training a model for new tasks. Instead, it learns the LoRA modules combination weights based on a small number of samples to form the task model. However, task-level LoRAs combination only utilizes a few task modules due to its reliance on the weight enumeration method, and it also ignores the specificity between different instances. Therefore, we proposed an instance-level LoRAs composition for cross-task generalization, which selects appropriate multiple task LoRA modules for each input instance and dynamically determines the composition weights. Our experiments on publicly available datasets show that our method outperforms the typical method, LoraHub, in 16 out of 27 tasks. We release the source code at https://github.com/noname822/iLoraComp.git
Anthology ID:
2024.findings-emnlp.326
Volume:
Findings of the Association for Computational Linguistics: EMNLP 2024
Month:
November
Year:
2024
Address:
Miami, Florida, USA
Editors:
Yaser Al-Onaizan, Mohit Bansal, Yun-Nung Chen
Venue:
Findings
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
5699–5708
Language:
URL:
https://aclanthology.org/2024.findings-emnlp.326
DOI:
Bibkey:
Cite (ACL):
Zhiqi Wang, Shizhu He, Kang Liu, and Jun Zhao. 2024. Instance-Level Dynamic LoRAs Composition for Cross-Task Generalization. In Findings of the Association for Computational Linguistics: EMNLP 2024, pages 5699–5708, Miami, Florida, USA. Association for Computational Linguistics.
Cite (Informal):
Instance-Level Dynamic LoRAs Composition for Cross-Task Generalization (Wang et al., Findings 2024)
Copy Citation:
PDF:
https://aclanthology.org/2024.findings-emnlp.326.pdf