Search papers, labs, and topics across Lattice.
The University of Tokyo
2
0
4
NavWAM turns visual foresight into executable robot actions, outperforming traditional planning methods in real-world navigation scenarios.
Despite advances in VLMs, agents struggle with active perception in 3D environments, revealing a significant gap in performance compared to humans.