Search papers, labs, and topics across Lattice.
Tsinghua University
1
0
3
The organization of post-training reasoning data into a cohesive framework reveals crucial insights that could accelerate advancements in large reasoning models.