BAIRENSFeb 19, 2026arXiv:2602.17608

Towards Anytime-Valid Statistical Watermarking

Baihe Huang, Eric Xu, Kannan Ramchandran, Jiantao Jiao, Michael I. Jordan

AI Summary

The paper introduces Anchored E-Watermarking, a novel statistical watermarking framework for LLMs that uses e-values to enable anytime-valid inference and optimal sampling distribution selection. This framework addresses the limitations of existing watermarking methods by constructing a test supermartingale, ensuring Type-I error guarantees even with optional stopping. By leveraging an anchor distribution to approximate the target model, the method optimizes the e-value with respect to the worst-case log-growth rate, leading to improved sample efficiency and a 13-15% reduction in the token budget needed for detection compared to baselines.

Key Contribution

Achieve 13-15% more efficient LLM watermark detection by using e-values for anytime-valid inference, enabling early stopping without sacrificing statistical guarantees.

Abstract

The proliferation of Large Language Models (LLMs) necessitates efficient mechanisms to distinguish machine-generated content from human text. While statistical watermarking has emerged as a promising solution, existing methods suffer from two critical limitations: the lack of a principled approach for selecting sampling distributions and the reliance on fixed-horizon hypothesis testing, which precludes valid early stopping. In this paper, we bridge this gap by developing the first e-value-based watermarking framework, Anchored E-Watermarking, that unifies optimal sampling with anytime-valid inference. Unlike traditional approaches where optional stopping invalidates Type-I error guarantees, our framework enables valid, anytime-inference by constructing a test supermartingale for the detection process. By leveraging an anchor distribution to approximate the target model, we characterize the optimal e-value with respect to the worst-case log-growth rate and derive the optimal expected stopping time. Our theoretical claims are substantiated by simulations and evaluations on established benchmarks, showing that our framework can significantly enhance sample efficiency, reducing the average token budget required for detection by 13-15% relative to state-of-the-art baselines.

Eval Frameworks & Benchmarks Natural Language Processing

Citation Metrics

Citations0

Influential citations0

References0

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

Towards Anytime-Valid Statistical Watermarking

Related Papers