Search papers, labs, and topics across Lattice.
1
0
3
A 7B model can now outperform larger models on a challenging math exam, thanks to a novel combination of merging, curriculum learning, and reinforcement learning with verifiable rewards.