Contextualization Distillation from Large Language Model for Knowledge Graph Completion

Dawei Li, Zhen Tan, Tianlong Chen, Huan Liu


Abstract
While textual information significantly enhances the performance of pre-trained language models (PLMs) in knowledge graph completion (KGC), the static and noisy nature of existing corpora collected from Wikipedia articles or synsets definitions often limits the potential of PLM-based KGC models. To surmount these challenges, we introduce the Contextualization Distillation strategy, a versatile plug-in-and-play approach compatible with both discriminative and generative KGC frameworks. Our method begins by instructing large language models (LLMs) to transform compact, structural triplets into context-rich segments. Subsequently, we introduce two tailored auxiliary tasks—reconstruction and contextualization—allowing smaller KGC models to assimilate insights from these enriched triplets. Comprehensive evaluations across diverse datasets and KGC techniques highlight the efficacy and adaptability of our approach, revealing consistent performance enhancements irrespective of underlying pipelines or architectures. Moreover, our analysis makes our method more explainable and provides insight into how to generate high-quality corpora for KGC, as well as the selection of suitable distillation tasks.
Anthology ID:
2024.findings-eacl.32
Original:
2024.findings-eacl.32v1
Version 2:
2024.findings-eacl.32v2
Volume:
Findings of the Association for Computational Linguistics: EACL 2024
Month:
March
Year:
2024
Address:
St. Julian’s, Malta
Editors:
Yvette Graham, Matthew Purver
Venue:
Findings
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
458–477
Language:
URL:
https://aclanthology.org/2024.findings-eacl.32
DOI:
Bibkey:
Cite (ACL):
Dawei Li, Zhen Tan, Tianlong Chen, and Huan Liu. 2024. Contextualization Distillation from Large Language Model for Knowledge Graph Completion. In Findings of the Association for Computational Linguistics: EACL 2024, pages 458–477, St. Julian’s, Malta. Association for Computational Linguistics.
Cite (Informal):
Contextualization Distillation from Large Language Model for Knowledge Graph Completion (Li et al., Findings 2024)
Copy Citation:
PDF:
https://aclanthology.org/2024.findings-eacl.32.pdf