MART: Improving LLM Safety with Multi-round Automatic Red-Teaming Suyu Ge author Chunting Zhou author Rui Hou author Madian Khabsa author Yi-Chia Wang author Qifan Wang author Jiawei Han author Yuning Mao author 2024-06 text Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers) Kevin Duh editor Helena Gomez editor Steven Bethard editor Association for Computational Linguistics Mexico City, Mexico conference publication ge-etal-2024-mart 10.18653/v1/2024.naacl-long.107 https://aclanthology.org/2024.naacl-long.107/ 2024-06 1927 1937