Distortion-free Watermarks are not Truly Distortion-free under Watermark Key Collisions

Wu, Yihan; Chen, Ruibo; Hu, Zhengmian; Chen, Yanshuo; Guo, Junfeng; Zhang, Hongyang; Huang, Heng

Computer Science > Cryptography and Security

arXiv:2406.02603 (cs)

[Submitted on 2 Jun 2024]

Title:Distortion-free Watermarks are not Truly Distortion-free under Watermark Key Collisions

Authors:Yihan Wu, Ruibo Chen, Zhengmian Hu, Yanshuo Chen, Junfeng Guo, Hongyang Zhang, Heng Huang

View PDF HTML (experimental)

Abstract:Language model (LM) watermarking techniques inject a statistical signal into LM-generated content by substituting the random sampling process with pseudo-random sampling, using watermark keys as the random seed. Among these statistical watermarking approaches, distortion-free watermarks are particularly crucial because they embed watermarks into LM-generated content without compromising generation quality. However, one notable limitation of pseudo-random sampling compared to true-random sampling is that, under the same watermark keys (i.e., key collision), the results of pseudo-random sampling exhibit correlations. This limitation could potentially undermine the distortion-free property. Our studies reveal that key collisions are inevitable due to the limited availability of watermark keys, and existing distortion-free watermarks exhibit a significant distribution bias toward the original LM distribution in the presence of key collisions. Moreover, achieving a perfect distortion-free watermark is impossible as no statistical signal can be embedded under key collisions. To reduce the distribution bias caused by key collisions, we introduce a new family of distortion-free watermarks--beta-watermark. Experimental results support that the beta-watermark can effectively reduce the distribution bias under key collisions.

Subjects:	Cryptography and Security (cs.CR); Machine Learning (cs.LG)
Cite as:	arXiv:2406.02603 [cs.CR]
	(or arXiv:2406.02603v1 [cs.CR] for this version)
	https://doi.org/10.48550/arXiv.2406.02603

Submission history

From: Yihan Wu [view email]
[v1] Sun, 2 Jun 2024 04:07:32 UTC (520 KB)

Computer Science > Cryptography and Security

Title:Distortion-free Watermarks are not Truly Distortion-free under Watermark Key Collisions

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Cryptography and Security

Title:Distortion-free Watermarks are not Truly Distortion-free under Watermark Key Collisions

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators