MergeIT: From Selection to Merging for Efficient Instruction Tuning

Hongyi Cai; Yuqian Fu; Hongming Fu; Bo Zhao

MergeIT: From Selection to Merging for Efficient Instruction Tuning

Hongyi Cai, Yuqian Fu, Hongming Fu, Bo Zhao

Abstract

Instruction tuning is crucial for optimizing Large Language Models (LLMs), as the quality and diversity of instructional data significantly influence model performance. This naturally underscores the importance of an effective and efficient data selection strategy. However, recent mainstream data selection methods typically rely on LLMs to score instruction quality—taking advantage of their capabilities, but at the cost of high computational overhead and reduced data diversity. To address these limitations, in this paper, we propose MergeIT, a novel LLM-based Merging strategy for better Instruction Tuning that shifts the focus from selection to synthesis. MergeIT consists of two stages: first, topic-aware filtering clusters and refines the dataset, preserving diversity while eliminating redundancy without relying on LLM-based scoring, significantly reducing time and computational cost. Second, LLM-based merging synthesizes semantically similar instructions into more informative and compact training data, enhancing data richness while further reducing the size of the dataset. Experimental results demonstrate that MergeIT enables efficient, diverse, and scalable instruction selection and synthesis, establishing LLM-based merging as a promising alternative to prior scoring-based selection methods for instruction tuning.

Anthology ID:: 2026.findings-acl.46
Volume:: Findings of the Association for Computational Linguistics: ACL 2026
Month:: July
Year:: 2026
Address:: San Diego, California, United States
Editors:: Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:: Findings
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 912–923
Language:
URL:: https://aclanthology.org/2026.findings-acl.46/
DOI:
Bibkey:
Cite (ACL):: Hongyi Cai, Yuqian Fu, Hongming Fu, and Bo Zhao. 2026. MergeIT: From Selection to Merging for Efficient Instruction Tuning. In Findings of the Association for Computational Linguistics: ACL 2026, pages 912–923, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):: MergeIT: From Selection to Merging for Efficient Instruction Tuning (Cai et al., Findings 2026)
Copy Citation:
PDF:: https://aclanthology.org/2026.findings-acl.46.pdf
Checklist:: 2026.findings-acl.46.checklist.pdf

PDF Cite Search Checklist Fix data