CxGBERT: BERT meets Construction Grammar

Harish Tayyar Madabushi; Laurence Romain; Dagmar Divjak; Petar Milin

doi:10.18653/v1/2020.coling-main.355

CxGBERT: BERT meets Construction Grammar

Harish Tayyar Madabushi, Laurence Romain, Dagmar Divjak, Petar Milin

Abstract

While lexico-semantic elements no doubt capture a large amount of linguistic information, it has been argued that they do not capture all information contained in text. This assumption is central to constructionist approaches to language which argue that language consists of constructions, learned pairings of a form and a function or meaning that are either frequent or have a meaning that cannot be predicted from its component parts. BERT’s training objectives give it access to a tremendous amount of lexico-semantic information, and while BERTology has shown that BERT captures certain important linguistic dimensions, there have been no studies exploring the extent to which BERT might have access to constructional information. In this work we design several probes and conduct extensive experiments to answer this question. Our results allow us to conclude that BERT does indeed have access to a significant amount of information, much of which linguists typically call constructional information. The impact of this observation is potentially far-reaching as it provides insights into what deep learning methods learn from text, while also showing that information contained in constructions is redundantly encoded in lexico-semantics.

Anthology ID:: 2020.coling-main.355
Volume:: Proceedings of the 28th International Conference on Computational Linguistics
Month:: December
Year:: 2020
Address:: Barcelona, Spain (Online)
Editors:: Donia Scott, Nuria Bel, Chengqing Zong
Venue:: COLING
SIG:
Publisher:: International Committee on Computational Linguistics
Note:
Pages:: 4020–4032
Language:
URL:: https://aclanthology.org/2020.coling-main.355
DOI:: 10.18653/v1/2020.coling-main.355
Bibkey:
Cite (ACL):: Harish Tayyar Madabushi, Laurence Romain, Dagmar Divjak, and Petar Milin. 2020. CxGBERT: BERT meets Construction Grammar. In Proceedings of the 28th International Conference on Computational Linguistics, pages 4020–4032, Barcelona, Spain (Online). International Committee on Computational Linguistics.
Cite (Informal):: CxGBERT: BERT meets Construction Grammar (Tayyar Madabushi et al., COLING 2020)
Copy Citation:
PDF:: https://aclanthology.org/2020.coling-main.355.pdf
Code: H-TayyarMadabushi/CxGBERT-BERT-meets-Construction-Grammar
Data: GLUE, SQuAD, WikiText-103, WikiText-2

PDF Cite Search Code