Search papers, labs, and topics across Lattice.
Technische Universit盲t Berlin
2
0
4
SympFormer achieves faster convergence in attention blocks by drawing inspiration from inertial Nesterov acceleration, offering a potential speedup without additional computational cost.
Forget fixed schedules: this new discrete diffusion model learns when to stop, adapting computation to the complexity of each reasoning problem.