PRP: Propagating Universal Perturbations to Attack Large Language Model Guard-Rails Neal Mangaokar author Ashish Hooda author Jihye Choi author Shreyas Chandrashekaran author Kassem Fawaz author Somesh Jha author Atul Prakash author 2024-08 text Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) Lun-Wei Ku editor Andre Martins editor Vivek Srikumar editor Association for Computational Linguistics Bangkok, Thailand conference publication mangaokar-etal-2024-prp 10.18653/v1/2024.acl-long.591 https://aclanthology.org/2024.acl-long.591/ 2024-08 10960 10976