LIRE: listwise reward enhancement for preference alignment Mingye Zhu author Yi Liu author Lei Zhang author Junbo Guo author Zhendong Mao author 2024-08 text Findings of the Association for Computational Linguistics: ACL 2024 Lun-Wei Ku editor Andre Martins editor Vivek Srikumar editor Association for Computational Linguistics Bangkok, Thailand conference publication zhu-etal-2024-lire 10.18653/v1/2024.findings-acl.201 https://aclanthology.org/2024.findings-acl.201/ 2024-08 3377 3394