Search papers, labs, and topics across Lattice.
2
0
4
0
Diffusion models, guided by vision-language models and geometric constraints, can now reconstruct realistic 3D human interactions from monocular video, even with severe occlusions.
Achieve realistic and responsive avatar interactions without expensive full-sequence inference, thanks to a novel modular framework that adapts pre-trained motion priors to diverse interaction scenarios.