Rodrigo Revilla Llaca
2023
Student-Teacher Prompting for Red Teaming to Improve Guardrails
Rodrigo Revilla Llaca
|
Victoria Leskoschek
|
Vitor Costa Paiva
|
Cătălin Lupău
|
Philip Lippmann
|
Jie Yang
Proceedings of the ART of Safety: Workshop on Adversarial testing and Red-Teaming for generative AI