Search papers, labs, and topics across Lattice.
School of Computing and Information Systems, University of Melbourne
1
0
3
Forget random noise – teaching models *how* to explore their reasoning process yields more reliable inference-time scaling.