S3Eval: A Synthetic, Scalable, Systematic Evaluation Suite for Large Language Model Fangyu Lei author Qian Liu author Yiming Huang author Shizhu He author Jun Zhao author Kang Liu author 2024-06 text Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers) Kevin Duh editor Helena Gomez editor Steven Bethard editor Association for Computational Linguistics Mexico City, Mexico conference publication lei-etal-2024-s3eval 10.18653/v1/2024.naacl-long.69 https://aclanthology.org/2024.naacl-long.69/ 2024-06 1259 1286