Search papers, labs, and topics across Lattice.
2
0
5
Forget finetuning video models for each robot: a single, action-free video world model can drive diverse robots when paired with a carefully designed inverse dynamics model.
Encoder-decoder architectures can beat decoder-only transformers in novel view synthesis, overturning conventional wisdom with a compute-optimal design (SVSM) that slashes training costs.