Probing Relational Knowledge in Language Models via Word Analogies

Kiamehr Rezaee, Jose Camacho-Collados


Abstract
Understanding relational knowledge plays an integral part in natural language comprehension. When it comes to pre-trained language models (PLM), prior work has been focusing on probing relational knowledge this by filling the blanks in pre-defined prompts such as “The capital of France is —". However, these probes may be affected by the co-occurrence of target relation words and entities (e.g. “capital”, “France” and “Paris”) in the pre-training corpus. In this work, we extend these probing methodologies leveraging analogical proportions as a proxy to probe relational knowledge in transformer-based PLMs without directly presenting the desired relation. In particular, we analysed the ability of PLMs to understand (1) the directionality of a given relation (e.g. Paris-France is not the same as France-Paris); (2) the ability to distinguish types on a given relation (both France and Japan are countries); and (3) the relation itself (Paris is the capital of France, but not Rome). Our results show how PLMs are extremely accurate at (1) and (2), but have clear room for improvement for (3). To better understand the reasons behind this behaviour and mistakes made by PLMs, we provide an extended quantitative analysis based on relevant factors such as frequency.
Anthology ID:
2022.findings-emnlp.289
Volume:
Findings of the Association for Computational Linguistics: EMNLP 2022
Month:
December
Year:
2022
Address:
Abu Dhabi, United Arab Emirates
Editors:
Yoav Goldberg, Zornitsa Kozareva, Yue Zhang
Venue:
Findings
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
3930–3936
Language:
URL:
https://aclanthology.org/2022.findings-emnlp.289
DOI:
10.18653/v1/2022.findings-emnlp.289
Bibkey:
Cite (ACL):
Kiamehr Rezaee and Jose Camacho-Collados. 2022. Probing Relational Knowledge in Language Models via Word Analogies. In Findings of the Association for Computational Linguistics: EMNLP 2022, pages 3930–3936, Abu Dhabi, United Arab Emirates. Association for Computational Linguistics.
Cite (Informal):
Probing Relational Knowledge in Language Models via Word Analogies (Rezaee & Camacho-Collados, Findings 2022)
Copy Citation:
PDF:
https://aclanthology.org/2022.findings-emnlp.289.pdf