Search papers, labs, and topics across Lattice.
1
0
3
LLMs can be coaxed into shorter, more accurate reasoning chains by rewarding tokens that maximize mutual information with the final answer.