Instructional Fingerprinting of Large Language Models

Jiashu Xu; Fei Wang; Mingyu Ma; Pang Wei Koh; Chaowei Xiao; Muhao Chen

doi:10.18653/v1/2024.naacl-long.180

Instructional Fingerprinting of Large Language Models

Jiashu Xu, Fei Wang, Mingyu Ma, Pang Wei Koh, Chaowei Xiao, Muhao Chen

Abstract

The exorbitant cost of training Large language models (LLMs) from scratch makes it essential to fingerprint the models to protect intellectual property via ownership authentication and to ensure downstream users and developers comply with their license terms (eg restricting commercial use). In this study, we present a pilot study on LLM fingerprinting as a form of very lightweight instruction tuning. Model publisher specifies a confidential private key and implants it as an instruction backdoor that causes the LLM to generate specific text when the key is present. Results on 11 popularly-used LLMs showed that this approach is lightweight and does not affect the normal behavior of the model. It also prevents publisher overclaim, maintains robustness against fingerprint guessing and parameter-efficient training, and supports multi-stage fingerprinting akin to MIT License.

Anthology ID:: 2024.naacl-long.180
Volume:: Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers)
Month:: June
Year:: 2024
Address:: Mexico City, Mexico
Editors:: Kevin Duh, Helena Gomez, Steven Bethard
Venue:: NAACL
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 3277–3306
Language:
URL:: https://aclanthology.org/2024.naacl-long.180
DOI:: 10.18653/v1/2024.naacl-long.180
Bibkey:
Cite (ACL):: Jiashu Xu, Fei Wang, Mingyu Ma, Pang Wei Koh, Chaowei Xiao, and Muhao Chen. 2024. Instructional Fingerprinting of Large Language Models. In Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), pages 3277–3306, Mexico City, Mexico. Association for Computational Linguistics.
Cite (Informal):: Instructional Fingerprinting of Large Language Models (Xu et al., NAACL 2024)
Copy Citation:
PDF:: https://aclanthology.org/2024.naacl-long.180.pdf

PDF Cite Search