Let the CAT out of the bag: Contrastive Attributed explanations for Text

Saneem Chemmengath, Amar Prakash Azad, Ronny Luss, Amit Dhurandhar


Abstract
Contrastive explanations for understanding the behavior of black box models has gained a lot of attention recently as they provide potential for recourse. In this paper, we propose a method Contrastive Attributed explanations for Text (CAT) which provides contrastive explanations for natural language text data with a novel twist as we build and exploit attribute classifiers leading to more semantically meaningful explanations. To ensure that our contrastive generated text has the fewest possible edits with respect to the original text, while also being fluent and close to a human generated contrastive, we resort to a minimal perturbation approach regularized using a BERT language model and attribute classifiers trained on available attributes. We show through qualitative examples and a user study that our method not only conveys more insight because of these attributes, but also leads to better quality (contrastive) text. Quantitatively, we show that our method outperforms other state-of-the-art methods across four data sets on four benchmark metrics.
Anthology ID:
2022.emnlp-main.484
Volume:
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing
Month:
December
Year:
2022
Address:
Abu Dhabi, United Arab Emirates
Editors:
Yoav Goldberg, Zornitsa Kozareva, Yue Zhang
Venue:
EMNLP
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
7190–7206
Language:
URL:
https://aclanthology.org/2022.emnlp-main.484
DOI:
10.18653/v1/2022.emnlp-main.484
Bibkey:
Cite (ACL):
Saneem Chemmengath, Amar Prakash Azad, Ronny Luss, and Amit Dhurandhar. 2022. Let the CAT out of the bag: Contrastive Attributed explanations for Text. In Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, pages 7190–7206, Abu Dhabi, United Arab Emirates. Association for Computational Linguistics.
Cite (Informal):
Let the CAT out of the bag: Contrastive Attributed explanations for Text (Chemmengath et al., EMNLP 2022)
Copy Citation:
PDF:
https://aclanthology.org/2022.emnlp-main.484.pdf