Search papers, labs, and topics across Lattice.
This paper introduces a refined detection mechanism for Aaronson's Gumbel watermarking scheme, improving its detection capabilities. The proposed detector is proven near-optimal within the class of model-agnostic detectors under the i.i.d. next-token distribution assumption. This advancement enhances the reliability and robustness of Gumbel watermarks for identifying AI-generated text.
Gumbel watermarks just got a whole lot harder to evade: a new detection method is provably near-optimal.
We propose a simple detection mechanism for the Gumbel watermarking scheme proposed by Aaronson (2022). The new mechanism is proven to be near-optimal in a problem-dependent sense among all model-agnostic watermarking schemes under the assumption that the next-token distribution is sampled i.i.d.