Feb 23, 2026arXiv:2602.19619

Is Your Diffusion Sampler Actually Correct? A Sampler-Centric Evaluation of Discrete Diffusion Language Models

Luhan Tang, Longxuan Yu, Shaorong Zhang, Greg Ver Steeg

AI Summary

The paper investigates the correctness of sampling procedures in discrete diffusion language models (dLLMs) by disentangling denoiser approximation error from sampler-induced error. They introduce a novel sampler-centric oracle framework that uses an exact Hidden Markov Model posterior as a denoiser, derived from a ground-truth Markov chain, to isolate and measure the error introduced by the sampler itself. Their findings reveal that few-step discrete diffusion samplers exhibit distributional inaccuracies even with a perfect denoiser, and that standard evaluation metrics do not reliably indicate sampling correctness.

Key Contribution

Even with a perfect denoiser, today's fast discrete diffusion samplers still produce incorrect distributions, revealing a fundamental flaw masked by standard evaluation metrics.

Abstract

Discrete diffusion language models (dLLMs) provide a fast and flexible alternative to autoregressive models (ARMs) via iterative denoising with parallel updates. However, their evaluation is challenging: existing metrics conflate denoiser approximation error with sampler-induced error from the sampling dynamics, a problem that does not arise for ARMs whose autoregressive sampling exactly reflects the learned probability model. We introduce a sampler-centric oracle framework that replaces learned denoisers with an exact Hidden Markov Model posterior derived from a ground-truth Markov chain, isolating sampler-induced error in a controlled setting. We show that few-step discrete diffusion samplers are not distributionally correct even under an oracle denoiser, with transition-level mismatch that vanishes only as the number of steps approaches the sequence length. Moreover, improvements in negative log-likelihood, generative perplexity, or MAUVE do not imply correct sampling. Code is available at https://luhantang.github.io/dllm_sampler

Architecture Design (Transformers, SSMs, MoE)Eval Frameworks & Benchmarks Natural Language Processing

Citation Metrics

Citations0

Influential citations0

References0

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

Is Your Diffusion Sampler Actually Correct? A Sampler-Centric Evaluation of Discrete Diffusion Language Models

Related Papers