Search papers, labs, and topics across Lattice.
2
0
4
2
By "dreaming ahead" with learned latent visual dynamics, LatentPilot achieves state-of-the-art vision-and-language navigation, demonstrating the power of future-aware reasoning without needing future observations at test time.
Unlock the power of web videos for embodied AI: implicit geometry representations let agents learn to navigate from real-world room tours without relying on fragile 3D reconstruction.