Search papers, labs, and topics across Lattice.
MoE Key Lab of BIPC, University of Science and Technology of China, Shanghai Innovation Institute
1
0
3
Unlock long-context reasoning in LLMs by turning agent trajectories into gold-standard QA pairs, outperforming models 8x larger on challenging reasoning tasks.