Interventional Probing in High Dimensions: An NLI Case Study

Julia Rozanova; Marco Valentino; Lucas Cordeiro; André Freitas

doi:10.18653/v1/2023.findings-eacl.188

Interventional Probing in High Dimensions: An NLI Case Study

Julia Rozanova, Marco Valentino, Lucas Cordeiro, André Freitas

Abstract

Probing strategies have been shown to detect the presence of various linguistic features in large language models; in particular, semantic features intermediate to the “natural logic” fragment of the Natural Language Inference task (NLI). In the case of natural logic, the relation between the intermediate features and the entailment label is explicitly known: as such, this provides a ripe setting for interventional studies on the NLI models’ representations, allowing for stronger causal conjectures and a deeper critical analysis of interventional probing methods. In this work, we carry out new and existing representation-level interventions to investigate the effect of these semantic features on NLI classification: we perform amnesic probing (which removes features as directed by learned linear probes) and introduce the mnestic probing variation (which forgets all dimensions except the probe-selected ones). Furthermore, we delve into the limitations of these methods and outline some pitfalls have been obscuring the effectivity of interventional probing studies.

Anthology ID:: 2023.findings-eacl.188
Volume:: Findings of the Association for Computational Linguistics: EACL 2023
Month:: May
Year:: 2023
Address:: Dubrovnik, Croatia
Editors:: Andreas Vlachos, Isabelle Augenstein
Venue:: Findings
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 2489–2500
Language:
URL:: https://aclanthology.org/2023.findings-eacl.188/
DOI:: 10.18653/v1/2023.findings-eacl.188
Bibkey:
Cite (ACL):: Julia Rozanova, Marco Valentino, Lucas Cordeiro, and André Freitas. 2023. Interventional Probing in High Dimensions: An NLI Case Study. In Findings of the Association for Computational Linguistics: EACL 2023, pages 2489–2500, Dubrovnik, Croatia. Association for Computational Linguistics.
Cite (Informal):: Interventional Probing in High Dimensions: An NLI Case Study (Rozanova et al., Findings 2023)
Copy Citation:
PDF:: https://aclanthology.org/2023.findings-eacl.188.pdf

PDF Cite Search Fix data