Search papers, labs, and topics across Lattice.
2
0
4
Forget benchmarks - SuperValid's capability-aligned validation loss robustly predicts downstream LLM performance across architectures, scales, and training distributions.
Ditch SwiGLU's quadratic instability: PowLU offers a rational power function that stabilizes LLM pre-training without sacrificing performance.