Search papers, labs, and topics across Lattice.
2
0
6
A 4B model can rival the mathematical reasoning of models 30x its size, proving that clever training trumps brute force scaling.
Forget simple scaling laws: the compute-optimal number of parallel rollouts in LLM RL plateaus, revealing distinct mechanisms for easy vs. hard problems.