LLMInit: A Free Lunch from Large Language Models for Selective Initialization of Recommendation

Weizhi Zhang; Liangwei Yang; Wooseong Yang; Henry Peng Zou; Yuqing Liu; Ke Xu; Sourav Medya; Philip S. Yu

doi:10.18653/v1/2025.emnlp-industry.141

LLMInit: A Free Lunch from Large Language Models for Selective Initialization of Recommendation

Weizhi Zhang, Liangwei Yang, Wooseong Yang, Henry Peng Zou, Yuqing Liu, Ke Xu, Sourav Medya, Philip S. Yu

Abstract

Collaborative filtering (CF) is widely adopted in industrial recommender systems (RecSys) for modeling user-item interactions across numerous applications, but often struggles with cold-start and data-sparse scenarios. Recent advancements in pre-trained large language models (LLMs) with rich semantic knowledge, offer promising solutions to these challenges. However, deploying LLMs at scale is hindered by their significant computational demands and latency. In this paper, we propose a novel and scalable LLM-RecSys framework, LLMInit, designed to integrate pretrained LLM embeddings into CF models through selective initialization strategies. Specifically, we identify the embedding collapse issue observed when CF models scale and match the large embedding sizes in LLMs and avoid the problem by introducing efficient sampling methods, including, random, uniform, and variance-based selections. Comprehensive experiments conducted on multiple real-world datasets demonstrate that LLMInit significantly improves recommendation performance while maintaining low computational costs, offering a practical and scalable solution for industrial applications. To facilitate industry adoption and promote future research, we provide open-source access to our implementation at https://github.com/DavidZWZ/LLMInit.

Anthology ID:: 2025.emnlp-industry.141
Volume:: Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing: Industry Track
Month:: November
Year:: 2025
Address:: Suzhou (China)
Editors:: Saloni Potdar, Lina Rojas-Barahona, Sebastien Montella
Venue:: EMNLP
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 2016–2024
Language:
URL:: https://aclanthology.org/2025.emnlp-industry.141/
DOI:: 10.18653/v1/2025.emnlp-industry.141
Bibkey:
Cite (ACL):: Weizhi Zhang, Liangwei Yang, Wooseong Yang, Henry Peng Zou, Yuqing Liu, Ke Xu, Sourav Medya, and Philip S. Yu. 2025. LLMInit: A Free Lunch from Large Language Models for Selective Initialization of Recommendation. In Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing: Industry Track, pages 2016–2024, Suzhou (China). Association for Computational Linguistics.
Cite (Informal):: LLMInit: A Free Lunch from Large Language Models for Selective Initialization of Recommendation (Zhang et al., EMNLP 2025)
Copy Citation:
PDF:: https://aclanthology.org/2025.emnlp-industry.141.pdf

PDF Cite Search Fix data