Search papers, labs, and topics across Lattice.
Princeton University
2
0
5
VLMs can get a 10% boost in spatial reasoning and 3D understanding by training on just 10,000 synthetic images generated automatically from task keywords.
Open-sourcing Vero, a VLM trained with RL on a diverse 600K-sample dataset, closes the performance gap with proprietary models and reveals that broad task coverage, not just scale, is the key to unlocking general visual reasoning.