Search papers, labs, and topics across Lattice.
Tsinghua University
1
0
3
LLMs can be taught to reason more comprehensively over long contexts by rewarding not just the final answer, but also the quality of the reasoning steps taken to arrive at that answer.