Daniel Mela


2024

pdf bib
Mass-Editing Memory with Attention in Transformers: A cross-lingual exploration of knowledge
Daniel Mela | Aitor Gonzalez-Agirre | Javier Hernando | Marta Villegas
Findings of the Association for Computational Linguistics ACL 2024

Recent research has explored methods for updating and modifying factual knowledge in large language models, often focusing on specific multi-layer perceptron blocks. This study expands on this work by examining the effectiveness of existing knowledge editing methods across languages and delving into the role of attention mechanisms in this process. Drawing from the insights gained, we propose Mass-Editing Memory with Attention in Transformers (MEMAT), a method that achieves significant improvements in all metrics while requiring minimal parameter modifications. MEMAT delivers a remarkable 10% increase in magnitude metrics, benefits languages not included in the training data and also demonstrates a high degree of portability. Our code and data are at https://github.com/dtamayo-nlp/MEMAT.