Watermarking for Large Language Models

Xuandong Zhao, Yu-Xiang Wang, Lei Li


Abstract
As AI-generated text increasingly resembles human-written content, the ability to detect machine-generated text becomes crucial in both the computational linguistics and machine learning communities. In this tutorial, we aim to provide an in-depth exploration of text watermarking, a subfield of linguistic steganography with the goal of embedding a hidden message (the watermark) within a text passage. We will introduce the fundamentals of text watermarking, discuss the main challenges in identifying AI-generated text, and delve into the current watermarking methods, assessing their strengths and weaknesses. Moreover, we will explore other possible applications of text watermarking and discuss future directions for this field. Each section will be supplemented with examples and key takeaways.
Anthology ID:
2024.luhme-tutorials.6
Volume:
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 5: Tutorial Abstracts)
Month:
August
Year:
2024
Address:
Bangkok, Thailand
Editors:
Luis Chiruzzo, Hung-yi Lee, Leonardo Ribeiro
Venue:
ACL
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
10–11
Language:
URL:
https://aclanthology.org/2024.luhme-tutorials.6/
DOI:
10.18653/v1/2024.acl-tutorials.6
Bibkey:
Cite (ACL):
Xuandong Zhao, Yu-Xiang Wang, and Lei Li. 2024. Watermarking for Large Language Models. In Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 5: Tutorial Abstracts), pages 10–11, Bangkok, Thailand. Association for Computational Linguistics.
Cite (Informal):
Watermarking for Large Language Models (Zhao et al., ACL 2024)
Copy Citation:
PDF:
https://aclanthology.org/2024.acl-tutorials.6.pdf