Search papers, labs, and topics across Lattice.
1
1
2
2
Forget blindly chasing teacher-student disagreement in on-policy distillation – focusing on *learnable* disagreement, where the teacher nudges the student within its existing possibilities, unlocks surprisingly efficient learning.