Search papers, labs, and topics across Lattice.
2
0
3
3
Parameter importance isn't forever: dynamically adapting which parameters are frozen during fine-tuning significantly improves generalization and reduces forgetting in LLMs.
Achieve better token efficiency in LLM policy optimization by using a novel FiberPO objective whose Jacobian is block-diagonal over trajectories and reduces to identity on-policy.