Search papers, labs, and topics across Lattice.
The University of Tokyo
2
0
5
Despite advances in VLMs, agents struggle with active perception in 3D environments, revealing a significant gap in performance compared to humans.
Ditch the energy functions: C-voting unlocks better test-time reasoning in recurrent models by simply picking the most confident trajectory.