Search papers, labs, and topics across Lattice.
Beijing University of Technology
1
0
3
Forget sifting through mountains of MCTS trajectories – contrasting successful and failed reasoning paths distills 10x more effective training data for reasoning models.