NeuroX Library for Neuron Analysis of Deep NLP Models

Fahim Dalvi, Hassan Sajjad, Nadir Durrani


Abstract
Neuron analysis provides insights into how knowledge is structured in representations and discovers the role of neurons in the network. In addition to developing an understanding of our models, neuron analysis enables various applications such as debiasing, domain adaptation and architectural search. We present NeuroX, a comprehensive open-source toolkit to conduct neuron analysis of natural language processing models. It implements various interpretation methods under a unified API, and provides a framework for data processing and evaluation, thus making it easier for researchers and practitioners to perform neuron analysis. The Python toolkit is available at https://www.github.com/fdalvi/NeuroX.Demo Video available at: https://youtu.be/mLhs2YMx4u8
Anthology ID:
2023.acl-demo.21
Volume:
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 3: System Demonstrations)
Month:
July
Year:
2023
Address:
Toronto, Canada
Editors:
Danushka Bollegala, Ruihong Huang, Alan Ritter
Venue:
ACL
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
226–234
Language:
URL:
https://aclanthology.org/2023.acl-demo.21
DOI:
10.18653/v1/2023.acl-demo.21
Bibkey:
Cite (ACL):
Fahim Dalvi, Hassan Sajjad, and Nadir Durrani. 2023. NeuroX Library for Neuron Analysis of Deep NLP Models. In Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 3: System Demonstrations), pages 226–234, Toronto, Canada. Association for Computational Linguistics.
Cite (Informal):
NeuroX Library for Neuron Analysis of Deep NLP Models (Dalvi et al., ACL 2023)
Copy Citation:
PDF:
https://aclanthology.org/2023.acl-demo.21.pdf
Video:
 https://aclanthology.org/2023.acl-demo.21.mp4