SteerLM: Attribute Conditioned SFT as an (User-Steerable) Alternative to RLHF Yi Dong author Zhilin Wang author Makesh Sreedhar author Xianchao Wu author Oleksii Kuchaiev author 2023-12 text Findings of the Association for Computational Linguistics: EMNLP 2023 Houda Bouamor editor Juan Pino editor Kalika Bali editor Association for Computational Linguistics Singapore conference publication dong-etal-2023-steerlm 10.18653/v1/2023.findings-emnlp.754 https://aclanthology.org/2023.findings-emnlp.754/ 2023-12 11275 11288