Red Teaming Language Models with Language Models Ethan Perez author Saffron Huang author Francis Song author Trevor Cai author Roman Ring author John Aslanides author Amelia Glaese author Nat McAleese author Geoffrey Irving author 2022-12 text Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing Yoav Goldberg editor Zornitsa Kozareva editor Yue Zhang editor Association for Computational Linguistics Abu Dhabi, United Arab Emirates conference publication perez-etal-2022-red 10.18653/v1/2022.emnlp-main.225 https://aclanthology.org/2022.emnlp-main.225/ 2022-12 3419 3448