Search papers, labs, and topics across Lattice.
1
0
2
Muon, an optimizer designed for stable deep learning, provably converges even when trained with noisy, heavy-tailed data, outperforming standard SGD.