Search papers, labs, and topics across Lattice.
University of North Carolina at Charlotte
2
0
4
Unlock the Rosetta Stone for neural networks: UAV lets one model explain the inner workings of *any* other, regardless of architecture or size.
On-policy distillation can lead to catastrophic length inflation in student models, but a simple fix stabilizes training and boosts performance by 7%.