CLEAN–EVAL: Clean Evaluation on Contaminated Large Language Models Wenhong Zhu author Hongkun Hao author Zhiwei He author Yun-Ze Song author Jiao Yueyang author Yumeng Zhang author Hanxu Hu author Yiran Wei author Rui Wang author Hongyuan Lu author 2024-06 text Findings of the Association for Computational Linguistics: NAACL 2024 Kevin Duh editor Helena Gomez editor Steven Bethard editor Association for Computational Linguistics Mexico City, Mexico conference publication zhu-etal-2024-clean 10.18653/v1/2024.findings-naacl.53 https://aclanthology.org/2024.findings-naacl.53/ 2024-06 835 847