Search papers, labs, and topics across Lattice.
1
0
4
Stop letting SFT ruin your LMMs: PRISM uses on-policy distillation to realign your model *before* RL, boosting performance by up to 6%.