Federated Foundation Models: Privacy-Preserving and Collaborative Learning for Large Models

Sixing Yu; Juan Pablo Munoz; Ali Jannesari

Federated Foundation Models: Privacy-Preserving and Collaborative Learning for Large Models

Sixing Yu, Juan Pablo Munoz, Ali Jannesari

Abstract

Foundation Models (FMs), such as LLaMA, BERT, GPT, ViT, and CLIP, have demonstrated remarkable success in a wide range of applications, driven by their ability to leverage vast amounts of data for pre-training. However, optimizing FMs often requires access to sensitive data, raising privacy concerns and limiting their applicability in many domains. In this paper, we propose the Federated Foundation Models (FFMs) paradigm, which combines the benefits of FMs and Federated Learning (FL) to enable privacy-preserving and collaborative learning across multiple end-users. We discuss the potential benefits and challenges of integrating FL into the lifespan of FMs, covering pre-training, fine-tuning, and application. We further outline potential future research avenues in FFM, including FFM pre-training, FFM fine-tuning, and federated prompt tuning, which allow the development of more personalized and context-aware models while ensuring data privacy. Moreover, we explore the possibility of continual/lifelong learning in FFMs, as increased computational power at the edge may unlock the potential for optimizing FMs using newly generated private data close to the data source. The proposed FFM concepts offer a flexible and scalable framework for training large language models in a privacy-preserving manner, setting the stage for subsequent advancements in both FM training and federated learning.

Anthology ID:: 2024.lrec-main.630
Volume:: Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024)
Month:: May
Year:: 2024
Address:: Torino, Italia
Editors:: Nicoletta Calzolari, Min-Yen Kan, Veronique Hoste, Alessandro Lenci, Sakriani Sakti, Nianwen Xue
Venues:: LREC | COLING
SIG:
Publisher:: ELRA and ICCL
Note:
Pages:: 7174–7184
Language:
URL:: https://aclanthology.org/2024.lrec-main.630
DOI:
Bibkey:
Cite (ACL):: Sixing Yu, Juan Pablo Munoz, and Ali Jannesari. 2024. Federated Foundation Models: Privacy-Preserving and Collaborative Learning for Large Models. In Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024), pages 7174–7184, Torino, Italia. ELRA and ICCL.
Cite (Informal):: Federated Foundation Models: Privacy-Preserving and Collaborative Learning for Large Models (Yu et al., LREC-COLING 2024)
Copy Citation:
PDF:: https://aclanthology.org/2024.lrec-main.630.pdf

PDF Cite Search