Search papers, labs, and topics across Lattice.
1
0
3
4
Achieve significant reasoning gains in frozen LLMs (+22.4%) without retraining by adaptively routing reward model guidance at the token level during inference.