Search papers, labs, and topics across Lattice.
The Chinese University of Hong Kong, Shenzhen Loop Area Institute
1
0
3
Pruning reasoning paths with a learned "STOP" token slashes compute costs and boosts accuracy in large reasoning models, outperforming existing methods.