Search papers, labs, and topics across Lattice.
4
0
6
11
Achieve SOTA in long-horizon spatial understanding by training a model to streamingly maintain and update spatial evidence from video via test-time adaptation of fast weights.
Forget hand-annotated 3D datasets: a new automated pipeline generates massive, high-quality 3D spatial intelligence from raw video, unlocking better VLM reasoning.
Ditch the linear CFG gains: Sliding Mode Control offers provably stable and semantically richer diffusion guidance, especially when you crank up the guidance scale.
Achieve more realistic and physically plausible scene reconstructions from video by explicitly optimizing viewpoints for object generation and synthesizing scene graphs within a 3D simulator.