Learning Preference Model for LLMs via Automatic Preference Data Generation Shijia Huang author Jianqiao Zhao author Yanyang Li author Liwei Wang author 2023-12 text Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing Houda Bouamor editor Juan Pino editor Kalika Bali editor Association for Computational Linguistics Singapore conference publication huang-etal-2023-learning-preference 10.18653/v1/2023.emnlp-main.570 https://aclanthology.org/2023.emnlp-main.570/ 2023-12 9187 9199