NaturalCodeBench: Examining Coding Performance Mismatch on HumanEval and Natural User Queries Shudan Zhang author Hanlin Zhao author Xiao Liu author Qinkai Zheng author Zehan Qi author Xiaotao Gu author Yuxiao Dong author Jie Tang author 2024-08 text Findings of the Association for Computational Linguistics: ACL 2024 Lun-Wei Ku editor Andre Martins editor Vivek Srikumar editor Association for Computational Linguistics Bangkok, Thailand conference publication zhang-etal-2024-naturalcodebench 10.18653/v1/2024.findings-acl.471 https://aclanthology.org/2024.findings-acl.471/ 2024-08 7907 7928