Unraveling the Mystery: Defending Against Jailbreak Attacks Via Unearthing Real Intention Yanhao Li author Hongshen Chen author Heng Zhang author Zhiwei Ge author Tianhao Li author Sulong Xu author Guibo Luo author 2025-01 text Proceedings of the 31st International Conference on Computational Linguistics Owen Rambow editor Leo Wanner editor Marianna Apidianaki editor Hend Al-Khalifa editor Barbara Di Eugenio editor Steven Schockaert editor Association for Computational Linguistics Abu Dhabi, UAE conference publication li-etal-2025-unraveling https://aclanthology.org/2025.coling-main.560/ 2025-01 8374 8384