Cheems: A Practical Guidance for Building and Evaluating Chinese Reward Models from Scratch

Xueru Wen; Jie Lou; Zichao Li; Yaojie Lu; Xing Yu; Yuqiu Ji; Guohai Xu; Hongyu Lin; Ben He; Xianpei Han; Le Sun; Debing Zhang

doi:10.18653/v1/2025.acl-long.737

Cheems: A Practical Guidance for Building and Evaluating Chinese Reward Models from Scratch

Xueru Wen, Jie Lou, Zichao Li, Yaojie Lu, Xing Yu, Yuqiu Ji, Guohai Xu, Hongyu Lin, Ben He, Xianpei Han, Le Sun, Debing Zhang

Abstract

Reward models (RMs) are crucial for aligning large language models (LLMs) with human preferences. However, most RM research is centered on English and relies heavily on synthetic resources, which leads to limited and less reliable datasets and benchmarks for Chinese. To address this gap, we introduce CheemsBench, a fully human-annotated RM evaluation benchmark within Chinese contexts, and CheemsPreference, a large-scale and diverse preference dataset annotated through human-machine collaboration to support Chinese RM training. We systematically evaluate open-source discriminative and generative RMs on CheemsBench and observe significant limitations in their ability to capture human preferences in Chinese scenarios. Additionally, based on CheemsPreference, we construct an RM that achieves state-of-the-art performance on CheemsBench, demonstrating the necessity of human supervision in RM training. Our findings reveal that scaled AI-generated data struggles to fully capture human preferences, emphasizing the importance of high-quality human supervision in RM development.

Anthology ID:: 2025.acl-long.737
Volume:: Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Month:: July
Year:: 2025
Address:: Vienna, Austria
Editors:: Wanxiang Che, Joyce Nabende, Ekaterina Shutova, Mohammad Taher Pilehvar
Venue:: ACL
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 15187–15211
Language:
URL:: https://aclanthology.org/2025.acl-long.737/
DOI:: 10.18653/v1/2025.acl-long.737
Bibkey:
Cite (ACL):: Xueru Wen, Jie Lou, Zichao Li, Yaojie Lu, Xing Yu, Yuqiu Ji, Guohai Xu, Hongyu Lin, Ben He, Xianpei Han, Le Sun, and Debing Zhang. 2025. Cheems: A Practical Guidance for Building and Evaluating Chinese Reward Models from Scratch. In Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 15187–15211, Vienna, Austria. Association for Computational Linguistics.
Cite (Informal):: Cheems: A Practical Guidance for Building and Evaluating Chinese Reward Models from Scratch (Wen et al., ACL 2025)
Copy Citation:
PDF:: https://aclanthology.org/2025.acl-long.737.pdf

PDF Cite Search Fix data