Zhouxing Shi
2024
Red Teaming Language Model Detectors with Language Models
Zhouxing Shi
|
Yihan Wang
|
Fan Yin
|
Xiangning Chen
|
Kai-Wei Chang
|
Cho-Jui Hsieh
Transactions of the Association for Computational Linguistics, Volume 12
Defending LLMs against Jailbreaking Attacks via Backtranslation
Yihan Wang
|
Zhouxing Shi
|
Andrew Bai
|
Cho-Jui Hsieh
Findings of the Association for Computational Linguistics: ACL 2024
2022
On the Sensitivity and Stability of Model Interpretations in NLP
Fan Yin
|
Zhouxing Shi
|
Cho-Jui Hsieh
|
Kai-Wei Chang
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
2020
Robustness to Modification with Shared Words in Paraphrase Identification
Zhouxing Shi
|
Minlie Huang
Findings of the Association for Computational Linguistics: EMNLP 2020
Co-authors
- Cho-Jui Hsieh 3
- Yihan Wang 2
- Fan Yin 2
- Kai-Wei Chang 2
- Xiangning Chen 1
- show all...