Search papers, labs, and topics across Lattice.
1
0
3
0
Diffusion language models can achieve faster decoding and better accuracy by learning directly from the token reveal order suggested by a lightweight autoregressive teacher, without expensive distillation.