Search papers, labs, and topics across Lattice.
1
0
2
Sparse updates in on-policy distillation can match full performance with significantly reduced training overhead, challenging conventional wisdom about dense parameter updates.