Search papers, labs, and topics across Lattice.
1
0
2
A single normalization step turns Muon into Muon+, delivering consistent perplexity improvements in LLM pre-training.