Search papers, labs, and topics across Lattice.
1
0
3
6
Ditch the coordinate system: VLMs can point *way* better by directly selecting visual tokens, leading to SOTA results and improved sample efficiency.