Methods for Measuring, Updating, and Visualizing Factual Beliefs in Language Models

Peter Hase; Mona Diab; Asli Celikyilmaz; Xian Li; Zornitsa Kozareva; Veselin Stoyanov; Mohit Bansal; Srinivasan Iyer

doi:10.18653/v1/2023.eacl-main.199

Methods for Measuring, Updating, and Visualizing Factual Beliefs in Language Models

Peter Hase, Mona Diab, Asli Celikyilmaz, Xian Li, Zornitsa Kozareva, Veselin Stoyanov, Mohit Bansal, Srinivasan Iyer

Abstract

Language models can memorize a considerable amount of factual information during pretraining that can be elicited through prompting or finetuning models on tasks like question answering. In this paper, we discuss approaches to measuring model factual beliefs, updating incorrect factual beliefs in models, and visualizing graphical relationships between factual beliefs. Our main contributions include: (1) new metrics for evaluating belief-updating methods focusing on the logical consistency of beliefs, (2) a training objective for Sequential, Local, and Generalizing updates (SLAG) that improves the performance of existing hypernetwork approaches, and (3) the introduction of the belief graph, a new form of visualization for language models that shows relationships between stored model beliefs. Our experiments suggest that models show only limited consistency between factual beliefs, but update methods can both fix incorrect model beliefs and greatly improve their consistency. Although off-the-shelf optimizers are surprisingly strong belief-updating baselines, our learned optimizers can outperform them in more difficult settings than have been considered in past work.

Anthology ID:: 2023.eacl-main.199
Volume:: Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics
Month:: May
Year:: 2023
Address:: Dubrovnik, Croatia
Editors:: Andreas Vlachos, Isabelle Augenstein
Venue:: EACL
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 2714–2731
Language:
URL:: https://aclanthology.org/2023.eacl-main.199/
DOI:: 10.18653/v1/2023.eacl-main.199
Bibkey:
Cite (ACL):: Peter Hase, Mona Diab, Asli Celikyilmaz, Xian Li, Zornitsa Kozareva, Veselin Stoyanov, Mohit Bansal, and Srinivasan Iyer. 2023. Methods for Measuring, Updating, and Visualizing Factual Beliefs in Language Models. In Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics, pages 2714–2731, Dubrovnik, Croatia. Association for Computational Linguistics.
Cite (Informal):: Methods for Measuring, Updating, and Visualizing Factual Beliefs in Language Models (Hase et al., EACL 2023)
Copy Citation:
PDF:: https://aclanthology.org/2023.eacl-main.199.pdf
Video:: https://aclanthology.org/2023.eacl-main.199.mp4

PDF Cite Search Video Fix data