Search papers, labs, and topics across Lattice.
2
0
5
RLVR's success in long-horizon reasoning hinges on a smooth difficulty spectrum, where mastering easier sub-problems unlocks the ability to tackle harder ones, avoiding frustrating grokking plateaus.
Discrete diffusion models can now sample high-dimensional data with provable sublinear convergence rates, adapting to hidden structure without prior knowledge.