Fisher Information-based Efficient Curriculum Federated Learning with Large Language Models

Ji Liu; Jiaxiang Ren; Ruoming Jin; Zijie Zhang; Yang Zhou; Patrick Valduriez; Dejing Dou

doi:10.18653/v1/2024.emnlp-main.587

Fisher Information-based Efficient Curriculum Federated Learning with Large Language Models

Ji Liu, Jiaxiang Ren, Ruoming Jin, Zijie Zhang, Yang Zhou, Patrick Valduriez, Dejing Dou

Abstract

As a promising paradigm to collaboratively train models with decentralized data, Federated Learning (FL) can be exploited to fine-tune Large Language Models (LLMs). While LLMs correspond to huge size, the scale of the training data significantly increases, which leads to tremendous amounts of computation and communication costs. The training data is generally non-Independent and Identically Distributed (non-IID), which requires adaptive data processing within each device. Although Low-Rank Adaptation (LoRA) can significantly reduce the scale of parameters to update in the fine-tuning process, it still takes unaffordable time to transfer the low-rank parameters of all the layers in LLMs. In this paper, we propose a Fisher Information-based Efficient Curriculum Federated Learning framework (FibecFed) with two novel methods, i.e., adaptive federated curriculum learning and efficient sparse parameter update. First, we propose a fisher information-based method to adaptively sample data within each device to improve the effectiveness of the FL fine-tuning process. Second, we dynamically select the proper layers for global aggregation and sparse parameters for local update with LoRA so as to improve the efficiency of the FL fine-tuning process. Extensive experimental results based on 10 datasets demonstrate that FibecFed yields excellent performance (up to 45.35% in terms of accuracy) and superb fine-tuning speed (up to 98.61% faster) compared with 17 baseline approaches).

Anthology ID:: 2024.emnlp-main.587
Volume:: Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing
Month:: November
Year:: 2024
Address:: Miami, Florida, USA
Editors:: Yaser Al-Onaizan, Mohit Bansal, Yun-Nung Chen
Venue:: EMNLP
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 10497–10523
Language:
URL:: https://aclanthology.org/2024.emnlp-main.587/
DOI:: 10.18653/v1/2024.emnlp-main.587
Bibkey:
Cite (ACL):: Ji Liu, Jiaxiang Ren, Ruoming Jin, Zijie Zhang, Yang Zhou, Patrick Valduriez, and Dejing Dou. 2024. Fisher Information-based Efficient Curriculum Federated Learning with Large Language Models. In Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, pages 10497–10523, Miami, Florida, USA. Association for Computational Linguistics.
Cite (Informal):: Fisher Information-based Efficient Curriculum Federated Learning with Large Language Models (Liu et al., EMNLP 2024)
Copy Citation:
PDF:: https://aclanthology.org/2024.emnlp-main.587.pdf
Software:: 2024.emnlp-main.587.software.zip

PDF Cite Search Software Fix data