LMFlow: An Extensible Toolkit for Finetuning and Inference of Large Foundation Models

Shizhe Diao; Rui Pan; Hanze Dong; Kashun Shum; Jipeng Zhang; Wei Xiong; Tong Zhang

LMFlow: An Extensible Toolkit for Finetuning and Inference of Large Foundation Models

Shizhe Diao, Rui Pan, Hanze Dong, KaShun Shum, Jipeng Zhang, Wei Xiong, Tong Zhang

Abstract

Foundation models have demonstrated a great ability to achieve general human-level intelligence far beyond traditional approaches. As the technique keeps attracting attention from the AI community, more and more foundation models have become publicly available.However, most of those models exhibit a major deficiency in specialized-domain and specialized-task applications, where the step of domain- and task-aware finetuning is still required to obtain scientific language models. As the number of available foundation models and specialized tasks keeps growing, the job of training scientific language models becomes highly nontrivial. In this paper, we take the first step to address this issue. We introduce an extensible and lightweight toolkit, LMFlow, which aims to simplify the domain- and task-aware finetuning of general foundation models.LMFlow offers a complete finetuning workflow for a foundation model to support specialized training with limited computing resources.Furthermore, it supports continuous pretraining, instruction tuning, parameter-efficient finetuning, alignment tuning, inference acceleration, long context generalization, model customization, and even multimodal finetuning, along with carefully designed and extensible APIs. This toolkit has been thoroughly tested and is available at https://github.com/OptimalScale/LMFlow.

Anthology ID:: 2024.naacl-demo.12
Volume:: Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 3: System Demonstrations)
Month:: June
Year:: 2024
Address:: Mexico City, Mexico
Editors:: Kai-Wei Chang, Annie Lee, Nazneen Rajani
Venue:: NAACL
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 116–127
Language:
URL:: https://aclanthology.org/2024.naacl-demo.12
DOI:
Bibkey:
Cite (ACL):: Shizhe Diao, Rui Pan, Hanze Dong, KaShun Shum, Jipeng Zhang, Wei Xiong, and Tong Zhang. 2024. LMFlow: An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. In Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 3: System Demonstrations), pages 116–127, Mexico City, Mexico. Association for Computational Linguistics.
Cite (Informal):: LMFlow: An Extensible Toolkit for Finetuning and Inference of Large Foundation Models (Diao et al., NAACL 2024)
Copy Citation:
PDF:: https://aclanthology.org/2024.naacl-demo.12.pdf

PDF Cite Search