Search papers, labs, and topics across Lattice.
1
0
3
4
Get up to 15% better multi-turn RL performance by moving tree search from inference to the *training rollout* stage, no optimizer changes needed.