Exploring Two-Phase Continual Instruction Fine-tuning for Multilingual Adaptation in Large Language Models

Divyanshu Aggarwal; Sankarshan Damle; Navin Goyal; Satya Lokam; Sunayana Sitaram

Exploring Two-Phase Continual Instruction Fine-tuning for Multilingual Adaptation in Large Language Models

Divyanshu Aggarwal, Sankarshan Damle, Navin Goyal, Satya Lokam, Sunayana Sitaram

Abstract

A key challenge for Large Language Models (LLMs) is improving their Multilingual instruction-following ability over time without deteriorating their ability in languages they already excel at, typically English. In this paper, we study a two-phase Continual Fine-tuning (CFT) setup toward improving a model’s Multilingual adaptability. Concretely, we consider a two-phase CFT process in which an English-only end-to-end instruction fine-tuned LLM (Phase 1) is sequentially fine-tuned on a multilingual instruction dataset (Phase 2). Across MISTRAL-7B and LLAMA-3-8B and multiple dataset pairs, we show that instructional similarity between phases is critical: aligned datasets preserve or improve English while boosting multilingual ability, whereas misaligned datasets cause English degradation. We show that this degradation arises from representation shift during CFT, and that targeted mitigation strategies, including generative replay and heuristic-based layer freezing, reduce this shift and improve multilingual adaptation.

Anthology ID:: 2026.findings-acl.1595
Volume:: Findings of the Association for Computational Linguistics: ACL 2026
Month:: July
Year:: 2026
Address:: San Diego, California, United States
Editors:: Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:: Findings
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 31882–31904
Language:
URL:: https://aclanthology.org/2026.findings-acl.1595/
DOI:
Bibkey:
Cite (ACL):: Divyanshu Aggarwal, Sankarshan Damle, Navin Goyal, Satya Lokam, and Sunayana Sitaram. 2026. Exploring Two-Phase Continual Instruction Fine-tuning for Multilingual Adaptation in Large Language Models. In Findings of the Association for Computational Linguistics: ACL 2026, pages 31882–31904, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):: Exploring Two-Phase Continual Instruction Fine-tuning for Multilingual Adaptation in Large Language Models (Aggarwal et al., Findings 2026)
Copy Citation:
PDF:: https://aclanthology.org/2026.findings-acl.1595.pdf
Checklist:: 2026.findings-acl.1595.checklist.pdf

PDF Cite Search Checklist Fix data