Fingerprinting LLMs via Prompt Injection

Yuepeng Hu; Zhengyuan Jiang; Mengyuan Li; Osama Ahmed; Zhicong Huang; Cheng Hong; Neil Zhenqiang Gong

doi:10.18653/v1/2026.acl-long.541

Fingerprinting LLMs via Prompt Injection

Yuepeng Hu, Zhengyuan Jiang, Mengyuan Li, Osama Ahmed, Zhicong Huang, Cheng Hong, Neil Zhenqiang Gong

Abstract

Large language models (LLMs) are often modified after release through post-processing such as post-training or quantization, which makes it challenging to determine whether one model is derived from another. Existing provenance detection methods have two main limitations: (1) they embed signals into the base model before release, which is infeasible for already published models, or (2) they compare outputs across models using hand-crafted or random prompts, which are not robust to post-processing. In this work, we propose LLMPrint, a novel detection framework that constructs fingerprints by exploiting LLMs’ inherent vulnerability to prompt injection. Our key insight is that by optimizing fingerprint prompts to enforce consistent token preferences, we can obtain fingerprints that are both unique to the base model and robust to post-processing. We further develop a unified verification procedure that applies to both gray-box and black-box settings, with statistical guarantees. We evaluate LLMPrint on five base models and around 700 post-trained or quantized variants. Our results show that LLMPrint achieves high true positive rates while keeping false positive rates near zero. The code is publicly available at https://github.com/hifi-hyp/ACL-LLMPrint.

Anthology ID:: 2026.acl-long.541
Volume:: Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Month:: July
Year:: 2026
Address:: San Diego, California, United States
Editors:: Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:: ACL
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 11795–11810
Language:
URL:: https://aclanthology.org/2026.acl-long.541/
DOI:: 10.18653/v1/2026.acl-long.541
Bibkey:
Cite (ACL):: Yuepeng Hu, Zhengyuan Jiang, Mengyuan Li, Osama Ahmed, Zhicong Huang, Cheng Hong, and Neil Zhenqiang Gong. 2026. Fingerprinting LLMs via Prompt Injection. In Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 11795–11810, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):: Fingerprinting LLMs via Prompt Injection (Hu et al., ACL 2026)
Copy Citation:
PDF:: https://aclanthology.org/2026.acl-long.541.pdf
Checklist:: 2026.acl-long.541.checklist.pdf

PDF Cite Search Checklist Fix data