Search papers, labs, and topics across Lattice.
1
0
2
7
LLMs learn faster and perform better in decision-making tasks when rewarded for being uncertain, not just for succeeding.