Search papers, labs, and topics across Lattice.
2
52
4
4
Embodied navigation agents, already struggling, fall apart when faced with the kinds of messy, real-world sensor and instruction corruptions that NavTrust now exposes.
Unlock human-like spatial reasoning in VLMs with VLM-3R, which reconstructs 3D understanding from monocular video using instruction tuning, bypassing the need for external depth sensors.