Search papers, labs, and topics across Lattice.
2
0
5
Unlock 36% better video depth estimation and 20% better camera pose estimation by simply letting your model learn from its own unlabeled video predictions.
Achieve precise, coherent, and mask-free 3D editing from text prompts by having a multimodal LLM decompose the prompt into structural and appearance-level guidance for a rectified-flow inpainting pipeline.