%0 Conference Proceedings %T Eliminating Biased Length Reliance of Direct Preference Optimization via Down-Sampled KL Divergence %A Lu, Junru %A Li, Jiazheng %A An, Siyu %A Zhao, Meng %A He, Yulan %A Yin, Di %A Sun, Xing %Y Al-Onaizan, Yaser %Y Bansal, Mohit %Y Chen, Yun-Nung %S Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing %D 2024 %8 November %I Association for Computational Linguistics %C Miami, Florida, USA %F lu-etal-2024-eliminating %R 10.18653/v1/2024.emnlp-main.60 %U https://aclanthology.org/2024.emnlp-main.60/ %U https://doi.org/10.18653/v1/2024.emnlp-main.60 %P 1047-1067