LeakDojo: Decoding the Leakage Threats of RAG Systems

Maosen Zhang; Jianshuo Dong; Lu Boting; Li Wenyue; Xiaoping Zhang; Tianwei Zhang; Han Qiu

LeakDojo: Decoding the Leakage Threats of RAG Systems

Maosen Zhang, Jianshuo Dong, Lu Boting, Li Wenyue, Xiaoping Zhang, Tianwei Zhang, Han Qiu

Abstract

Retrieval-Augmented Generation (RAG) enables large language models (LLMs) to leverage external knowledge, but also exposes valuable RAG databases to leakage attacks. As RAG systems grow more complex and LLMs exhibit stronger instruction-following capabilities, existing studies fall short of systematically assessing RAG leakage risks. We present LeakDojo, a configurable framework for controlled evaluation of RAG leakage. Using LeakDojo, we benchmark six existing attacks across fourteen LLMs, four datasets, and diverse RAG systems. Our study reveals that (1) query generation and adversarial instructions contribute independently to leakage, with overall leakage well approximated by their product; (2) stronger instruction-following capability correlates with higher leakage risk; and (3) improvements in RAG faithfulness can introduce increased leakage risk. These findings provide actionable insights for understanding and mitigating RAG leakage in practice. Our codebase is available at https://github.com/yeasen-z/LeakDojo.

Anthology ID:: 2026.findings-acl.287
Volume:: Findings of the Association for Computational Linguistics: ACL 2026
Month:: July
Year:: 2026
Address:: San Diego, California, United States
Editors:: Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:: Findings
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 5790–5811
Language:
URL:: https://aclanthology.org/2026.findings-acl.287/
DOI:
Bibkey:
Cite (ACL):: Maosen Zhang, Jianshuo Dong, Lu Boting, Li Wenyue, Xiaoping Zhang, Tianwei Zhang, and Han Qiu. 2026. LeakDojo: Decoding the Leakage Threats of RAG Systems. In Findings of the Association for Computational Linguistics: ACL 2026, pages 5790–5811, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):: LeakDojo: Decoding the Leakage Threats of RAG Systems (Zhang et al., Findings 2026)
Copy Citation:
PDF:: https://aclanthology.org/2026.findings-acl.287.pdf
Checklist:: 2026.findings-acl.287.checklist.pdf

PDF Cite Search Checklist Fix data