%0 Conference Proceedings %T Don‘t Forget Your Reward Values: Language Model Alignment via Value-based Calibration %A Mao, Xin %A Li, Feng-Lin %A Xu, Huimin %A Zhang, Wei %A Chen, Wang %A Luu, Anh Tuan %Y Al-Onaizan, Yaser %Y Bansal, Mohit %Y Chen, Yun-Nung %S Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing %D 2024 %8 November %I Association for Computational Linguistics %C Miami, Florida, USA %F mao-etal-2024-dont %R 10.18653/v1/2024.emnlp-main.976 %U https://aclanthology.org/2024.emnlp-main.976/ %U https://doi.org/10.18653/v1/2024.emnlp-main.976 %P 17622-17642