Jun 1, 2026arXiv:2606.02158

On the Salience of Low-Probability Tokens for AI-Generated Text Detection: A Multiscale Uncertainty Perspective

Yikai Guo, Bin Wang, Xilai Fan, Wenjun Ke, Haoran Luo

AI Summary

This paper addresses the challenge of detecting AI-generated text by introducing a multiscale uncertainty estimator called Uncertainty, which focuses on low-probability tokens that highlight distributional discrepancies between human and AI writing. By averaging the log-probabilities of these tokens, the method mitigates the impact of boilerplate dominance, while employing Rényi entropy to enhance decision stability against adversarial manipulations. Experiments across multiple datasets and language models show that this approach significantly improves detection effectiveness and robustness compared to traditional statistical methods.

Key Contribution

Low-probability tokens can be the key to distinguishing AI-generated text from human writing, revealing hidden distributional discrepancies that traditional methods overlook.

Abstract

AI-generated text increasingly blends with human writing, raising practical risks such as misinformation, academic misuse, and corpora contamination. While statistical detectors are appealing for efficiency and generalization, they suffer from two key limitations. (i) Boilerplate dominance, boilerplate tokens shared across human and LLM writing can overwhelm discriminative signals. (ii) Brittle point estimates, relying on a single probability score yields unstable decisions under adversarial manipulations. To address these issues, we propose Uncertainty, a multiscale uncertainty estimator that focuses on informative low-probability tokens, which more clearly expose distributional discrepancies. Locally, it alleviates boilerplate dominance by averaging the log-probabilities of low-probability tokens; globally, it reduces brittleness by capturing the distributional shape of this low-probability region via Rényi entropy. We further extend the detector to Uncertainty++ via conditional independent sampling, yielding a more stable uncertainty estimation. Experiments across seven datasets and sixteen LLMs demonstrate high effectiveness, generalization, and robustness. Our code is available at https://github.com/guoyikai2000/Uncertainty-AIGT.

Data Curation & Synthetic Data Natural Language Processing Red-Teaming & Adversarial Robustness

Citation Metrics

Citations0

Influential citations0

References0

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

On the Salience of Low-Probability Tokens for AI-Generated Text Detection: A Multiscale Uncertainty Perspective

Related Papers