Multi-Stage LLM Fine-Tuning with a Continual Learning Setting

Changhao Guan; Chao Huang; Hongliang Li; You Li (李铀); Ning Cheng; Zihe Liu; Yufeng Chen; Jinan Xu (徐金安); Jian Liu

doi:10.18653/v1/2025.findings-naacl.303

Multi-Stage LLM Fine-Tuning with a Continual Learning Setting

Changhao Guan, Chao Huang, Hongliang Li, You Li, Ning Cheng, Zihe Liu, Yufeng Chen, Jinan Xu, Jian Liu

Abstract

In recent years, large language models (LLMs) have made significant progress in knowledge-intensive applications. However, when adapting them to specific domains, we may encounter a multi-stage continuous learning scenario, especially in cases where domain knowledge evolves rapidly.This issue severely limits traditional fine-tuning approaches for LLMs.To overcome this limitation, we propose a new learning paradigm designed specifically for multi-stage continuous learning. This paradigm includes a preference-based learning bias to identify potential knowledge conflicts, as well as a self-distillation-based data augmentation strategy to expand and enrich the training corpus, thereby improving the integration of knowledge-compatible information.In the experiments, we show that our proposed method achieves a significant improvement in accuracy after 7 stages of fine-tuning compared to previous methods, while also demonstrating excellent performance in preserving general knowledge.We have released our code and dataset at Multi-Stage-Learning.

Anthology ID:: 2025.findings-naacl.303
Volume:: Findings of the Association for Computational Linguistics: NAACL 2025
Month:: April
Year:: 2025
Address:: Albuquerque, New Mexico
Editors:: Luis Chiruzzo, Alan Ritter, Lu Wang
Venue:: Findings
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 5499–5513
Language:
URL:: https://aclanthology.org/2025.findings-naacl.303/
DOI:: 10.18653/v1/2025.findings-naacl.303
Bibkey:
Cite (ACL):: Changhao Guan, Chao Huang, Hongliang Li, You Li, Ning Cheng, Zihe Liu, Yufeng Chen, Jinan Xu, and Jian Liu. 2025. Multi-Stage LLM Fine-Tuning with a Continual Learning Setting. In Findings of the Association for Computational Linguistics: NAACL 2025, pages 5499–5513, Albuquerque, New Mexico. Association for Computational Linguistics.
Cite (Informal):: Multi-Stage LLM Fine-Tuning with a Continual Learning Setting (Guan et al., Findings 2025)
Copy Citation:
PDF:: https://aclanthology.org/2025.findings-naacl.303.pdf

PDF Cite Search Fix data