Search papers, labs, and topics across Lattice.
2
0
4
5
Ditch the action tokens: representing robot actions as interpretable, pixel-grounded "action images" lets your video backbone act as a zero-shot policy.
LLMs can now generate complex, physically plausible 3D scenes for robotics simulation by iteratively proposing assets and refining arrangements based on physics engine feedback.