CoPL: Collaborative Preference Learning for Personalizing LLMs

Youngbin Choi; Seunghyuk Cho; Minjong Lee; MoonJeong Park; Yesong Ko; Jungseul Ok; Dongwoo Kim

doi:10.18653/v1/2025.emnlp-main.650

CoPL: Collaborative Preference Learning for Personalizing LLMs

Youngbin Choi, Seunghyuk Cho, Minjong Lee, MoonJeong Park, Yesong Ko, Jungseul Ok, Dongwoo Kim

Abstract

Personalizing large language models (LLMs) is important for aligning outputs with diverse user preferences, yet existing methods struggle with flexibility and generalization. We propose CoPL (Collaborative Preference Learning), a graph-based collaborative filtering framework that models user-response relationships to enhance preference estimation, particularly in sparse annotation settings. By integrating a mixture of LoRA experts, CoPL efficiently fine-tunes LLMs while dynamically balancing shared and user-specific preferences. Additionally, an optimization-free adaptation strategy enables generalization to unseen users without fine-tuning. Experiments on TL;DR, UltraFeedback-P, and PersonalLLM datasets demonstrate that CoPL outperforms existing personalized reward models, effectively capturing both common and controversial preferences, making it a scalable solution for personalized LLM alignment.

Anthology ID:: 2025.emnlp-main.650
Volume:: Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing
Month:: November
Year:: 2025
Address:: Suzhou, China
Editors:: Christos Christodoulopoulos, Tanmoy Chakraborty, Carolyn Rose, Violet Peng
Venue:: EMNLP
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 12875–12893
Language:
URL:: https://aclanthology.org/2025.emnlp-main.650/
DOI:: 10.18653/v1/2025.emnlp-main.650
Bibkey:
Cite (ACL):: Youngbin Choi, Seunghyuk Cho, Minjong Lee, MoonJeong Park, Yesong Ko, Jungseul Ok, and Dongwoo Kim. 2025. CoPL: Collaborative Preference Learning for Personalizing LLMs. In Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, pages 12875–12893, Suzhou, China. Association for Computational Linguistics.
Cite (Informal):: CoPL: Collaborative Preference Learning for Personalizing LLMs (Choi et al., EMNLP 2025)
Copy Citation:
PDF:: https://aclanthology.org/2025.emnlp-main.650.pdf
Checklist:: 2025.emnlp-main.650.checklist.pdf

PDF Cite Search Checklist Fix data