DeepMindANUDefence Science and Technology GroupMar 5, 2026arXiv:2603.04893

Free Lunch for Pass@$k$? Low Cost Diverse Sampling for Diffusion Language Models

Sean Lamont, Christian J. Walder, Christian Walder, Paul Montague, Amir Dezfouli, Michael Norrish

AI Summary

This paper introduces a training-free, low-cost sampling intervention for Diffusion Language Models (DLMs) to enhance generative diversity. The method, called "orthogonal diffusion decoding" (ODD), sequentially modifies intermediate samples in a batch by repelling them from the feature space of previously generated samples, penalizing redundancy. Evaluated on HumanEval and GSM8K using LLaDA-8B-Instruct, ODD significantly improves diversity and Pass@$k$ performance across various temperature settings.

Key Contribution

Achieve significantly better code generation and mathematical problem solving from diffusion language models with a simple, training-free sampling tweak that encourages diversity.

Abstract

Diverse outputs in text generation are necessary for effective exploration in complex reasoning tasks, such as code generation and mathematical problem solving. Such Pass@$k$ problems benefit from distinct candidates covering the solution space. However, traditional sampling approaches often waste computational resources on repetitive failure modes. While Diffusion Language Models have emerged as a competitive alternative to the prevailing Autoregressive paradigm, they remain susceptible to this redundancy, with independent samples frequently collapsing into similar modes. To address this, we propose a training free, low cost intervention to enhance generative diversity in Diffusion Language Models. Our approach modifies intermediate samples in a batch sequentially, where each sample is repelled from the feature space of previous samples, actively penalising redundancy. Unlike prior methods that require retraining or beam search, our strategy incurs negligible computational overhead, while ensuring that each sample contributes a unique perspective to the batch. We evaluate our method on the HumanEval and GSM8K benchmarks using the LLaDA-8B-Instruct model. Our results demonstrate significantly improved diversity and Pass@$k$ performance across various temperature settings. As a simple modification to the sampling process, our method offers an immediate, low-cost improvement for current and future Diffusion Language Models in tasks that benefit from diverse solution search. We make our code available at https://github.com/sean-lamont/odd.

Code Generation & Program Synthesis Natural Language Processing Reasoning & Chain-of-Thought

Citation Metrics

Citations0

Influential citations0

References17

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

Free Lunch for Pass@$k$? Low Cost Diverse Sampling for Diffusion Language Models

Related Papers