A Strong Baseline for Query Efficient Attacks in a Black Box Setting

Rishabh Maheshwary, Saket Maheshwary, Vikram Pudi


Abstract
Existing black box search methods have achieved high success rate in generating adversarial attacks against NLP models. However, such search methods are inefficient as they do not consider the amount of queries required to generate adversarial attacks. Also, prior attacks do not maintain a consistent search space while comparing different search methods. In this paper, we propose a query efficient attack strategy to generate plausible adversarial examples on text classification and entailment tasks. Our attack jointly leverages attention mechanism and locality sensitive hashing (LSH) to reduce the query count. We demonstrate the efficacy of our approach by comparing our attack with four baselines across three different search spaces. Further, we benchmark our results across the same search space used in prior attacks. In comparison to attacks proposed, on an average, we are able to reduce the query count by 75% across all datasets and target models. We also demonstrate that our attack achieves a higher success rate when compared to prior attacks in a limited query setting.
Anthology ID:
2021.emnlp-main.661
Volume:
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing
Month:
November
Year:
2021
Address:
Online and Punta Cana, Dominican Republic
Editors:
Marie-Francine Moens, Xuanjing Huang, Lucia Specia, Scott Wen-tau Yih
Venue:
EMNLP
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
8396–8409
Language:
URL:
https://aclanthology.org/2021.emnlp-main.661
DOI:
10.18653/v1/2021.emnlp-main.661
Bibkey:
Cite (ACL):
Rishabh Maheshwary, Saket Maheshwary, and Vikram Pudi. 2021. A Strong Baseline for Query Efficient Attacks in a Black Box Setting. In Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, pages 8396–8409, Online and Punta Cana, Dominican Republic. Association for Computational Linguistics.
Cite (Informal):
A Strong Baseline for Query Efficient Attacks in a Black Box Setting (Maheshwary et al., EMNLP 2021)
Copy Citation:
PDF:
https://aclanthology.org/2021.emnlp-main.661.pdf
Video:
 https://aclanthology.org/2021.emnlp-main.661.mp4
Code
 rishabhmaheshwary/query-attack
Data
IMDb Movie ReviewsMultiNLI