MAIN: Mutual Alignment Is Necessary for instruction tuning

Fanyi Yang; Jianfeng Liu; Xin Zhang; Haoyu Liu; Xixin Cao; Yuefeng Zhan; Hao Sun; Weiwei Deng; Feng Sun; Qi Zhang

doi:10.18653/v1/2025.emnlp-main.644

MAIN: Mutual Alignment Is Necessary for instruction tuning

Fanyi Yang, Jianfeng Liu, Xin Zhang, Haoyu Liu, Xixin Cao, Yuefeng Zhan, Hao Sun, Weiwei Deng, Feng Sun, Qi Zhang

Abstract

Instruction tuning has empowered large language models (LLMs) to achieve remarkable performance, yet its success heavily depends on the availability of large-scale, high-quality instruction-response pairs. To meet this demand, various methods have been developed to synthesize data at scale. However, current methods for scaling up data generation often overlook a crucial aspect: the alignment between instructions and responses. We hypothesize that the quality of instruction-response pairs is determined not by the individual quality of each component, but by the degree of mutual alignment. To address this, we propose a Mutual Alignment Framework (MAIN) which enforces coherence between instructions and responses through mutual constraints. We demonstrate that MAIN generalizes well across model architectures and sizes, achieving state-of-the-art performance on LLaMA, Mistral, and Qwen models across diverse benchmarks. This work underscores the critical role of instruction-response alignment in enabling generalizable and high-quality instruction tuning for LLMs. All code is available from our repository.

Anthology ID:: 2025.emnlp-main.644
Volume:: Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing
Month:: November
Year:: 2025
Address:: Suzhou, China
Editors:: Christos Christodoulopoulos, Tanmoy Chakraborty, Carolyn Rose, Violet Peng
Venue:: EMNLP
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 12757–12769
Language:
URL:: https://aclanthology.org/2025.emnlp-main.644/
DOI:: 10.18653/v1/2025.emnlp-main.644
Bibkey:
Cite (ACL):: Fanyi Yang, Jianfeng Liu, Xin Zhang, Haoyu Liu, Xixin Cao, Yuefeng Zhan, Hao Sun, Weiwei Deng, Feng Sun, and Qi Zhang. 2025. MAIN: Mutual Alignment Is Necessary for instruction tuning. In Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, pages 12757–12769, Suzhou, China. Association for Computational Linguistics.
Cite (Informal):: MAIN: Mutual Alignment Is Necessary for instruction tuning (Yang et al., EMNLP 2025)
Copy Citation:
PDF:: https://aclanthology.org/2025.emnlp-main.644.pdf
Checklist:: 2025.emnlp-main.644.checklist.pdf

PDF Cite Search Checklist Fix data