MeMoTune: A Measure and Moment-Driven Fine-Tuning Framework for Quantized Large Language Models

Yun Zhang; Xue Geng; Lizi Liao; Jintong Sun; Minghe Yu; Ge Yu (于戈)

doi:10.18653/v1/2025.findings-acl.208

MeMoTune: A Measure and Moment-Driven Fine-Tuning Framework for Quantized Large Language Models

Yun Zhang, Xue Geng, Lizi Liao, Jintong Sun, Minghe Yu, Ge Yu

Abstract

Quantizing large language models (LLMs) is essential for reducing memory and computational costs in natural language processing. Existing methods combine quantization with parameter-efficient fine-tuning but often fail to meet practical performance requirements. This paper introduces MeMoTune, a novel fine-tuning framework for quantized LLMs. By employing a measure and moment approach within a low-rank approximation framework in probability measure space, MeMoTune optimizes the objective function for superior fine-tuning results. The update process is further refined through scaled gradient, enhancing convergence efficiency and noise robustness. Experiments on tasks like text generation, summarization, and understanding show MeMoTune significantly outperforms state-of-the-art methods, e.g. fine-tuning Llama2-13B on GSM8K improves accuracy by 5.5%, while fine-tuning DeBERTaV3-base on CoLA of GLUE increases Matthews correlation by 1.7%. The code is publicly available at: https://github.com/hddyyyb/MeMoTune.

Anthology ID:: 2025.findings-acl.208
Volume:: Findings of the Association for Computational Linguistics: ACL 2025
Month:: July
Year:: 2025
Address:: Vienna, Austria
Editors:: Wanxiang Che, Joyce Nabende, Ekaterina Shutova, Mohammad Taher Pilehvar
Venue:: Findings
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 4036–4050
Language:
URL:: https://aclanthology.org/2025.findings-acl.208/
DOI:: 10.18653/v1/2025.findings-acl.208
Bibkey:
Cite (ACL):: Yun Zhang, Xue Geng, Lizi Liao, Jintong Sun, Minghe Yu, and Ge Yu. 2025. MeMoTune: A Measure and Moment-Driven Fine-Tuning Framework for Quantized Large Language Models. In Findings of the Association for Computational Linguistics: ACL 2025, pages 4036–4050, Vienna, Austria. Association for Computational Linguistics.
Cite (Informal):: MeMoTune: A Measure and Moment-Driven Fine-Tuning Framework for Quantized Large Language Models (Zhang et al., Findings 2025)
Copy Citation:
PDF:: https://aclanthology.org/2025.findings-acl.208.pdf

PDF Cite Search Fix data