Yinpeng Dong
2025
AutoBreach: Universal and Adaptive Jailbreaking with Efficient Wordplay-Guided Optimization via Multi-LLMs
Jiawei Chen
|
Xiao Yang
|
Zhengwei Fang
|
Yu Tian
|
Yinpeng Dong
|
Zhaoxia Yin
|
Hang Su
Findings of the Association for Computational Linguistics: NAACL 2025
Libra-Leaderboard: Towards Responsible AI through a Balanced Leaderboard of Safety and Capability
Haonan Li
|
Xudong Han
|
Zenan Zhai
|
Honglin Mu
|
Hao Wang
|
Zhenxuan Zhang
|
Yilin Geng
|
Shom Lin
|
Renxi Wang
|
Artem Shelmanov
|
Xiangyu Qi
|
Yuxia Wang
|
Donghai Hong
|
Youliang Yuan
|
Meng Chen
|
Haoqin Tu
|
Fajri Koto
|
Cong Zeng
|
Tatsuki Kuribayashi
|
Rishabh Bhardwaj
|
Bingchen Zhao
|
Yawen Duan
|
Yi Liu
|
Emad A. Alghamdi
|
Yaodong Yang
|
Yinpeng Dong
|
Soujanya Poria
|
Pengfei Liu
|
Zhengzhong Liu
|
Hector Xuguang Ren
|
Eduard Hovy
|
Iryna Gurevych
|
Preslav Nakov
|
Monojit Choudhury
|
Timothy Baldwin
Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies (System Demonstrations)
Co-authors
- Emad A. Alghamdi 1
- Timothy Baldwin 1
- Rishabh Bhardwaj 1
- Jiawei Chen (陈佳炜) 1
- Meng Chen 1
- show all...
- Monojit Choudhury 1
- Yawen Duan 1
- Zhengwei Fang 1
- Yilin Geng 1
- Iryna Gurevych 1
- Xudong Han 1
- Donghai Hong 1
- Eduard Hovy 1
- Fajri Koto 1
- Tatsuki Kuribayashi 1
- Haonan Li 1
- Shom Lin 1
- Yi Liu 1
- Pengfei Liu 1
- Zhengzhong Liu 1
- Honglin Mu 1
- Preslav Nakov 1
- Soujanya Poria 1
- Xiangyu Qi 1
- Hector Xuguang Ren 1
- Artem Shelmanov 1
- Hang Su 1
- Yu Tian 1
- Haoqin Tu 1
- Hao Wang (汪浩) 1
- Renxi Wang 1
- Yuxia Wang 1
- Xiao Yang (杨潇) 1
- Yaodong Yang 1
- Zhaoxia Yin 1
- Youliang Yuan 1
- Cong Zeng 1
- Zenan Zhai 1
- Zhenxuan Zhang 1
- Bingchen Zhao 1