Cross-modality Information Check for Detecting Jailbreaking in Multimodal Large Language Models

Yue Xu, Xiuyuan Qi, Zhan Qin, Wenjie Wang


Anthology ID:
2024.findings-emnlp.803
Volume:
Findings of the Association for Computational Linguistics: EMNLP 2024
Month:
November
Year:
2024
Address:
Miami, Florida, USA
Editors:
Yaser Al-Onaizan, Mohit Bansal, Yun-Nung Chen
Venue:
Findings
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
13715–13726
Language:
URL:
https://aclanthology.org/2024.findings-emnlp.803
DOI:
Bibkey:
Cite (ACL):
Yue Xu, Xiuyuan Qi, Zhan Qin, and Wenjie Wang. 2024. Cross-modality Information Check for Detecting Jailbreaking in Multimodal Large Language Models. In Findings of the Association for Computational Linguistics: EMNLP 2024, pages 13715–13726, Miami, Florida, USA. Association for Computational Linguistics.
Cite (Informal):
Cross-modality Information Check for Detecting Jailbreaking in Multimodal Large Language Models (Xu et al., Findings 2024)
Copy Citation:
PDF:
https://aclanthology.org/2024.findings-emnlp.803.pdf