Query-Driven Multimodal GraphRAG: Dynamic Local Knowledge Graph Construction for Online Reasoning

Chenyang Bu; Guojie Chang; Zihao Chen; CunYuan Dang; Zhize Wu; Yi He; Xindong Wu

doi:10.18653/v1/2025.findings-acl.1100

Query-Driven Multimodal GraphRAG: Dynamic Local Knowledge Graph Construction for Online Reasoning

Chenyang Bu, Guojie Chang, Zihao Chen, CunYuan Dang, Zhize Wu, Yi He, Xindong Wu

Abstract

An increasing adoption of Large Language Models (LLMs) in complex reasoning tasks necessitates their interpretability and reliability. Recent advances to that end include retrieval-augmented generation (RAG) and knowledge graph-enhanced RAG (GraphRAG), whereas they are constrained by static knowledge bases and ineffective multimodal data integration. In response, we propose a Query-Driven Multimodal GraphRAG framework that dynamically constructs local knowledge graphs tailored to query semantics. Our approach 1) derives graph patterns from query semantics to guide knowledge extraction, 2) employs a multi-path retrieval strategy to pinpoint core knowledge, and 3) supplements missing multimodal information ad hoc. Experimental results on the MultimodalQA and WebQA datasets demonstrate that our framework achieves the state-of-the-art performance among unsupervised competitors, particularly excelling in cross-modal understanding of complex queries.

Anthology ID:: 2025.findings-acl.1100
Volume:: Findings of the Association for Computational Linguistics: ACL 2025
Month:: July
Year:: 2025
Address:: Vienna, Austria
Editors:: Wanxiang Che, Joyce Nabende, Ekaterina Shutova, Mohammad Taher Pilehvar
Venue:: Findings
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 21360–21380
Language:
URL:: https://aclanthology.org/2025.findings-acl.1100/
DOI:: 10.18653/v1/2025.findings-acl.1100
Bibkey:
Cite (ACL):: Chenyang Bu, Guojie Chang, Zihao Chen, CunYuan Dang, Zhize Wu, Yi He, and Xindong Wu. 2025. Query-Driven Multimodal GraphRAG: Dynamic Local Knowledge Graph Construction for Online Reasoning. In Findings of the Association for Computational Linguistics: ACL 2025, pages 21360–21380, Vienna, Austria. Association for Computational Linguistics.
Cite (Informal):: Query-Driven Multimodal GraphRAG: Dynamic Local Knowledge Graph Construction for Online Reasoning (Bu et al., Findings 2025)
Copy Citation:
PDF:: https://aclanthology.org/2025.findings-acl.1100.pdf

PDF Cite Search Fix data