Search papers, labs, and topics across Lattice.
1
0
2
A confidence-based gating mechanism lets a 14B parameter reward model outperform 70B parameter models, achieving a new accuracy-efficiency Pareto frontier.