A Chinese Dataset for Evaluating the Safeguards in Large Language Models Yuxia Wang author Zenan Zhai author Haonan Li author Xudong Han author Shom Lin author Zhenxuan Zhang author Angela Zhao author Preslav Nakov author Timothy Baldwin author 2024-08 text Findings of the Association for Computational Linguistics: ACL 2024 Lun-Wei Ku editor Andre Martins editor Vivek Srikumar editor Association for Computational Linguistics Bangkok, Thailand conference publication wang-etal-2024-chinese 10.18653/v1/2024.findings-acl.184 https://aclanthology.org/2024.findings-acl.184/ 2024-08 3106 3119