A Survey on Foundation Language Models for Single-cell Biology

Fan Zhang; Hao Chen; Zhihong Zhu; Ziheng Zhang; Zhenxi Lin; Ziyue Qiao; Yefeng Zheng; Xian Wu

doi:10.18653/v1/2025.acl-long.26

A Survey on Foundation Language Models for Single-cell Biology

Fan Zhang, Hao Chen, Zhihong Zhu, Ziheng Zhang, Zhenxi Lin, Ziyue Qiao, Yefeng Zheng, Xian Wu

Abstract

The recent advancements in language models have significantly catalyzed progress in computational biology. A growing body of research strives to construct unified foundation models for single-cell biology, with language models serving as the cornerstone. In this paper, we systematically review the developments in foundation language models designed specifically for single-cell biology. Our survey offers a thorough analysis of various incarnations of single-cell foundation language models, viewed through the lens of both pre-trained language models (PLMs) and large language models (LLMs). This includes an exploration of data tokenization strategies, pre-training/tuning paradigms, and downstream single-cell data analysis tasks. Additionally, we discuss the current challenges faced by these pioneering works and speculate on future research directions. Overall, this survey provides a comprehensive overview of the existing single-cell foundation language models, paving the way for future research endeavors.

Anthology ID:: 2025.acl-long.26
Volume:: Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Month:: July
Year:: 2025
Address:: Vienna, Austria
Editors:: Wanxiang Che, Joyce Nabende, Ekaterina Shutova, Mohammad Taher Pilehvar
Venue:: ACL
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 528–549
Language:
URL:: https://aclanthology.org/2025.acl-long.26/
DOI:: 10.18653/v1/2025.acl-long.26
Bibkey:
Cite (ACL):: Fan Zhang, Hao Chen, Zhihong Zhu, Ziheng Zhang, Zhenxi Lin, Ziyue Qiao, Yefeng Zheng, and Xian Wu. 2025. A Survey on Foundation Language Models for Single-cell Biology. In Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 528–549, Vienna, Austria. Association for Computational Linguistics.
Cite (Informal):: A Survey on Foundation Language Models for Single-cell Biology (Zhang et al., ACL 2025)
Copy Citation:
PDF:: https://aclanthology.org/2025.acl-long.26.pdf

PDF Cite Search Fix data