Compressing LLM Knowledge into Graph Representations for Text-attributed Graphs Learning

Runhuai Chen; Dian Shen; Dandan Zhang; Kaihong Huang; Linghui Meng; Beilun Wang

Compressing LLM Knowledge into Graph Representations for Text-attributed Graphs Learning

Runhuai Chen, Dian Shen, Dandan Zhang, Kaihong Huang, Linghui Meng, Beilun Wang

Abstract

Text-attributed graphs (TAGs) require jointly modeling relational structure and node-level text. Existing GNN-LLM approaches perform by incorporating large language models at inference time for processing the text attributes, resulting in costly deployment. More fundamentally, LLM knowledge is typically used in a sample-wise manner, leading to inefficient utilization across graph instances. In this work, we study how interactions with LLM embedding spaces affect graph representations, and show that projecting into the LLM space can learn better GNNs. That is to say, the knowledge encoded in LLM embeddings can be compressed into graph representations. Based on this insight, we propose a framework that internalizes LLM knowledge within graph models and supports inference-efficient TAG learning. Our framework employs a hierarchical Proxy-Purifier module with distribution-level regularization, using LLM embeddings only as training-time guidance. With this module, the model operates TAGs without invoking LLMs, achieving high efficiency as standard GNNs without LLMs. Notably, experiments on five popular TAG tasks further demonstrate that our method can also achieve consistent performance gains, in comparison to existing GNN-LLM approaches.

Anthology ID:: 2026.acl-long.1398
Volume:: Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Month:: July
Year:: 2026
Address:: San Diego, California, United States
Editors:: Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:: ACL
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 30303–30318
Language:
URL:: https://aclanthology.org/2026.acl-long.1398/
DOI:
Bibkey:
Cite (ACL):: Runhuai Chen, Dian Shen, Dandan Zhang, Kaihong Huang, Linghui Meng, and Beilun Wang. 2026. Compressing LLM Knowledge into Graph Representations for Text-attributed Graphs Learning. In Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 30303–30318, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):: Compressing LLM Knowledge into Graph Representations for Text-attributed Graphs Learning (Chen et al., ACL 2026)
Copy Citation:
PDF:: https://aclanthology.org/2026.acl-long.1398.pdf
Checklist:: 2026.acl-long.1398.checklist.pdf

PDF Cite Search Checklist Fix data