WalledEval: A Comprehensive Safety Evaluation Toolkit for Large Language Models Prannaya Gupta author Le Qi Yau author Hao Han Low author I-Shiang Lee author Hugo Maximus Lim author Yu Xin Teoh author Koh Jia Hng author Dar Win Liew author Rishabh Bhardwaj author Rajat Bhardwaj author Soujanya Poria author 2024-11 text Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing: System Demonstrations Delia Irazu Hernandez Farias editor Tom Hope editor Manling Li editor Association for Computational Linguistics Miami, Florida, USA conference publication gupta-etal-2024-walledeval 10.18653/v1/2024.emnlp-demo.42 https://aclanthology.org/2024.emnlp-demo.42/ 2024-11 397 407