Arithmetic Control of LLMs for Diverse User Preferences: Directional Preference Alignment with Multi-Objective Rewards

Arithmetic Control of LLMs for Diverse User Preferences: Directional Preference Alignment with Multi-Objective Rewards Haoxiang Wang author Yong Lin author Wei Xiong author Rui Yang author Shizhe Diao author Shuang Qiu author Han Zhao author Tong Zhang author 2024-08 text Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) Lun-Wei Ku editor Andre Martins editor Vivek Srikumar editor Association for Computational Linguistics Bangkok, Thailand conference publication wang-etal-2024-arithmetic 10.18653/v1/2024.acl-long.468 https://aclanthology.org/2024.acl-long.468/ 2024-08 8642 8655