Search papers, labs, and topics across Lattice.
3
0
4
6
A vision-only driving model, distilled from a massive VLA teacher, not only matches but *exceeds* its teacher's performance, proving that there's still headroom in vision-centric architectures for autonomous driving.
Get 3x faster image segmentation and comparable video segmentation performance to fine-tuned models, all while keeping your vision encoder frozen.
Ditch the complex trackers: a plain ViT encoder, augmented with a clever query propagation trick, delivers state-of-the-art video segmentation at 10x the speed.