ReFT: Reasoning with Reinforced Fine-Tuning Luong Trung author Xinbo Zhang author Zhanming Jie author Peng Sun author Xiaoran Jin author Hang Li author 2024-08 text Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) Lun-Wei Ku editor Andre Martins editor Vivek Srikumar editor Association for Computational Linguistics Bangkok, Thailand conference publication trung-etal-2024-reft 10.18653/v1/2024.acl-long.410 https://aclanthology.org/2024.acl-long.410/ 2024-08 7601 7614