Search papers, labs, and topics across Lattice.
3
1
6
7
One model to control them all: Qwen-VLA achieves impressive zero-shot generalization across diverse robotic tasks and embodiments by unifying vision-language-action modeling.
Generative video models can now simulate a continuously evolving world, even when objects are out of sight, thanks to a new framework that maintains persistent global state.
VLNVerse tackles the sim-to-real gap in vision-language navigation by providing a unified, large-scale benchmark with realistic physics simulation and full-kinematics embodied agents.