Task Knowledge Injection via Interpolations and Reinstatement for Large Language Model Generalization

Yukun Zhao; Lingyong Yan; Zhenyang Li; Shuaiqiang Wang; Zhumin Chen; Zhaochun Ren; Dawei Yin

doi:10.18653/v1/2025.findings-acl.780

Task Knowledge Injection via Interpolations and Reinstatement for Large Language Model Generalization

Yukun Zhao, Lingyong Yan, Zhenyang Li, Shuaiqiang Wang, Zhumin Chen, Zhaochun Ren, Dawei Yin

Abstract

Large language models have shown tremendous potential across various NLP tasks, and instruction tuning has been widely adopted to elicit their superior performance. However, instruction tuning may overly tailor the models to task-specific formats, potentially compromising their generalization on unseen tasks. We attribute the issue to the spurious correlations learned between inputs and targets. We propose explicit task knowledge injection to mitigate these shortcuts with latent task adaptation and knowledge reinstatement. Latent tasks serve as interpolations between new tasks and facilitate knowledge sharing with joint adaptation enabling the model to build task knowledge more smoothly. Knowledge reinstatement helps optimize building new knowledge with prior knowledge. Specifically, we retrieve input-relevant latent tasks and jointly learn the task and the relevant latent tasks. Moreover, we prompt the model to recall the forms of inputs corresponding to the target and build the task knowledge through the reinstatement of prior knowledge while learning the new task.We conduct extensive experiments on state-of-the-art large language models including Llama3.1-8B and Vicuna-13B across 1000+ instruction-following tasks to demonstrate the effectiveness of our method. The results demonstrate our method improves generalization on both in-domain and out-of-domain unseen tasks.

Anthology ID:: 2025.findings-acl.780
Volume:: Findings of the Association for Computational Linguistics: ACL 2025
Month:: July
Year:: 2025
Address:: Vienna, Austria
Editors:: Wanxiang Che, Joyce Nabende, Ekaterina Shutova, Mohammad Taher Pilehvar
Venue:: Findings
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 15070–15080
Language:
URL:: https://aclanthology.org/2025.findings-acl.780/
DOI:: 10.18653/v1/2025.findings-acl.780
Bibkey:
Cite (ACL):: Yukun Zhao, Lingyong Yan, Zhenyang Li, Shuaiqiang Wang, Zhumin Chen, Zhaochun Ren, and Dawei Yin. 2025. Task Knowledge Injection via Interpolations and Reinstatement for Large Language Model Generalization. In Findings of the Association for Computational Linguistics: ACL 2025, pages 15070–15080, Vienna, Austria. Association for Computational Linguistics.
Cite (Informal):: Task Knowledge Injection via Interpolations and Reinstatement for Large Language Model Generalization (Zhao et al., Findings 2025)
Copy Citation:
PDF:: https://aclanthology.org/2025.findings-acl.780.pdf

PDF Cite Search Fix data