Generating Domain-Specific Knowledge Graphs from Large Language Models

Marinela Parović; Ze Li; Jinhua Du

doi:10.18653/v1/2025.findings-acl.602

Generating Domain-Specific Knowledge Graphs from Large Language Models

Abstract

Knowledge graphs (KGs) have been a cornerstone of search and recommendation due to their ability to store factual knowledge about any domain in a structured form enabling easy search and retrieval. Large language models (LLMs) have shown impressive world knowledge across different benchmarks and domains but their knowledge is inconveniently scattered across their billions of parameters. In this paper, we propose a prompt-based method to construct domain-specific KGs by extracting knowledge solely from LLMs’ parameters. First, we use an LLM to create a schema for a specific domain, which contains a set of domain-representative entities and relations. After that, we use the schema to guide the LLM through an iterative data generation process equipped with Chain-of-Verification (CoVe) for increased data quality. Using this method, we construct KGs for two domains: books and landmarks, which we then evaluate against Wikidata, an open-source human-created KG. Our results show that LLMs can generate large domain-specific KGs containing tens of thousands of entities and relations. However, due to the increased hallucination rates as the procedure evolves, the utility of large-scale LLM-generated KGs in practical applications could remain limited.

Anthology ID:: 2025.findings-acl.602
Volume:: Findings of the Association for Computational Linguistics: ACL 2025
Month:: July
Year:: 2025
Address:: Vienna, Austria
Editors:: Wanxiang Che, Joyce Nabende, Ekaterina Shutova, Mohammad Taher Pilehvar
Venue:: Findings
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 11558–11574
Language:
URL:: https://aclanthology.org/2025.findings-acl.602/
DOI:: 10.18653/v1/2025.findings-acl.602
Bibkey:
Cite (ACL):: Marinela Parović, Ze Li, and Jinhua Du. 2025. Generating Domain-Specific Knowledge Graphs from Large Language Models. In Findings of the Association for Computational Linguistics: ACL 2025, pages 11558–11574, Vienna, Austria. Association for Computational Linguistics.
Cite (Informal):: Generating Domain-Specific Knowledge Graphs from Large Language Models (Parović et al., Findings 2025)
Copy Citation:
PDF:: https://aclanthology.org/2025.findings-acl.602.pdf

PDF Cite Search Fix data