Search papers, labs, and topics across Lattice.
1
0
3
6
Stop rewarding reasoning that just looks good – reward reasoning that actually *helps* the downstream model solve the task.