Search papers, labs, and topics across Lattice.
2
0
4
0
Robot video world models can be significantly improved by distilling a multimodal reward function and stabilizing long-horizon inference, leading to better instruction following and manipulation accuracy.
Robots get a spatial-temporal reasoning boost with STARRY, a world model that aligns future predictions with action generation, leading to a significant jump in manipulation success.