Search papers, labs, and topics across Lattice.
1
0
3
2
Tiny VLMs can punch far above their weight: a 0.5B parameter model, guided by Switch-KD, closes the gap with its 3B teacher by 3.6 points on multimodal benchmarks.