Search papers, labs, and topics across Lattice.
1
3
14
Forget scaling laws: this work shows you can get SOTA reasoning from sub-billion parameter models with *less* data, if you're smart about curation and resampling.