Search papers, labs, and topics across Lattice.
Beijing University of Posts and Telecommunications
1
0
4
4
RL fine-tuning of discrete diffusion models can be made dramatically more stable and effective by treating the final denoised sample as the action and reconstructing trajectories using the forward diffusion process.