Search papers, labs, and topics across Lattice.
1
0
2
Hyperball achieves a remarkable 20-30% speedup in language model pretraining, even as model sizes grow, challenging the limits of traditional optimizers.