Unsupervised Distillation of Syntactic Information from Contextualized Word Representations

Shauli Ravfogel; Yanai Elazar; Jacob Goldberger; Yoav Goldberg

doi:10.18653/v1/2020.blackboxnlp-1.9

Unsupervised Distillation of Syntactic Information from Contextualized Word Representations

Shauli Ravfogel, Yanai Elazar, Jacob Goldberger, Yoav Goldberg

Abstract

Contextualized word representations, such as ELMo and BERT, were shown to perform well on various semantic and syntactic task. In this work, we tackle the task of unsupervised disentanglement between semantics and structure in neural language representations: we aim to learn a transformation of the contextualized vectors, that discards the lexical semantics, but keeps the structural information. To this end, we automatically generate groups of sentences which are structurally similar but semantically different, and use metric-learning approach to learn a transformation that emphasizes the structural component that is encoded in the vectors. We demonstrate that our transformation clusters vectors in space by structural properties, rather than by lexical semantics. Finally, we demonstrate the utility of our distilled representations by showing that they outperform the original contextualized representations in a few-shot parsing setting.

Anthology ID:: 2020.blackboxnlp-1.9
Volume:: Proceedings of the Third BlackboxNLP Workshop on Analyzing and Interpreting Neural Networks for NLP
Month:: November
Year:: 2020
Address:: Online
Editors:: Afra Alishahi, Yonatan Belinkov, Grzegorz Chrupała, Dieuwke Hupkes, Yuval Pinter, Hassan Sajjad
Venue:: BlackboxNLP
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 91–106
Language:
URL:: https://aclanthology.org/2020.blackboxnlp-1.9/
DOI:: 10.18653/v1/2020.blackboxnlp-1.9
Bibkey:
Cite (ACL):: Shauli Ravfogel, Yanai Elazar, Jacob Goldberger, and Yoav Goldberg. 2020. Unsupervised Distillation of Syntactic Information from Contextualized Word Representations. In Proceedings of the Third BlackboxNLP Workshop on Analyzing and Interpreting Neural Networks for NLP, pages 91–106, Online. Association for Computational Linguistics.
Cite (Informal):: Unsupervised Distillation of Syntactic Information from Contextualized Word Representations (Ravfogel et al., BlackboxNLP 2020)
Copy Citation:
PDF:: https://aclanthology.org/2020.blackboxnlp-1.9.pdf

PDF Cite Search Fix data