SpARK: An Embarrassingly Simple Sparse Watermarking in LLMs with Enhanced Text Quality

Duy Cao Hoang; Thanh Quoc Hung Le; Rui Chu; Ping Li; Weijie Zhao; Yingjie Lao; Khoa D Doan

SpARK: An Embarrassingly Simple Sparse Watermarking in LLMs with Enhanced Text Quality

Duy Cao Hoang, Thanh Quoc Hung Le, Rui Chu, Ping Li, Weijie Zhao, Yingjie Lao, Khoa D Doan

Abstract

With the widespread adoption of Large Language Models (LLMs), concerns about potential misuse have emerged. To this end, watermarking has been adapted to LLM, enabling a simple and effective way to detect and monitor generated text. However, while the existing methods can differentiate between watermarked and unwatermarked text with high accuracy, they often face a trade-off between the quality of the generated text and the effectiveness of the watermarking process. In this work, we present a novel type of LLM watermark, Sparse WatermARK (or SpARK), which aims to mitigate this trade-off by applying watermarks to a small subset of generated tokens distributed across the text. To demonstrate this type of watermark, we introduce two novel variants, SpARK-P and SpARK-R, which achieve sparsity by anchoring watermarked tokens to words that have specific Part-of-Speech (POS) tags and specific hash values w.r.t a pseudorandom hash function, respectively. Our experimental results demonstrate that the proposed watermarking schemes, albeit embarrassingly simple, are incredibly effective, achieving high detectability while generating text that outperforms previous LLM watermarking methods in quality across various tasks. SpARK further advances the watermarking capability for LLMs while maintaining their generated text quality.

Anthology ID:: 2026.findings-eacl.240
Volume:: Findings of the Association for Computational Linguistics: EACL 2026
Month:: March
Year:: 2026
Address:: Rabat, Morocco
Editors:: Vera Demberg, Kentaro Inui, Lluís Marquez
Venue:: Findings
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 4603–4626
Language:
URL:: https://aclanthology.org/2026.findings-eacl.240/
DOI:
Bibkey:
Cite (ACL):: Duy Cao Hoang, Thanh Quoc Hung Le, Rui Chu, Ping Li, Weijie Zhao, Yingjie Lao, and Khoa D Doan. 2026. SpARK: An Embarrassingly Simple Sparse Watermarking in LLMs with Enhanced Text Quality. In Findings of the Association for Computational Linguistics: EACL 2026, pages 4603–4626, Rabat, Morocco. Association for Computational Linguistics.
Cite (Informal):: SpARK: An Embarrassingly Simple Sparse Watermarking in LLMs with Enhanced Text Quality (Hoang et al., Findings 2026)
Copy Citation:
PDF:: https://aclanthology.org/2026.findings-eacl.240.pdf
Checklist:: 2026.findings-eacl.240.checklist.pdf

PDF Cite Search Checklist Fix data