Search papers, labs, and topics across Lattice.
2
0
3
2
Get 3x faster image segmentation and comparable video segmentation performance to fine-tuned models, all while keeping your vision encoder frozen.
Ditch the complex trackers: a plain ViT encoder, augmented with a clever query propagation trick, delivers state-of-the-art video segmentation at 10x the speed.