Search papers, labs, and topics across Lattice.
DuoMo, a novel generative method, addresses the challenge of reconstructing human motion in world-space coordinates from unconstrained videos by factorizing motion learning into two diffusion models: a camera-space model for initial estimation and a world-space model for refinement and global consistency. This factorization allows the method to generalize from noisy video inputs while maintaining global motion consistency, bypassing parametric models by directly generating the motion of mesh vertices. Experiments on EMDB and RICH datasets demonstrate state-of-the-art performance, achieving significant reductions in world-space reconstruction error and foot skating.
By disentangling camera-space estimation from world-space refinement via dual diffusion models, DuoMo achieves state-of-the-art human motion reconstruction from noisy video, bypassing the limitations of parametric models.
We present DuoMo, a generative method that recovers human motion in world-space coordinates from unconstrained videos with noisy or incomplete observations. Reconstructing such motion requires solving a fundamental trade-off: generalizing from diverse and noisy video inputs while maintaining global motion consistency. Our approach addresses this problem by factorizing motion learning into two diffusion models. The camera-space model first estimates motion from videos in camera coordinates. The world-space model then lifts this initial estimate into world coordinates and refines it to be globally consistent. Together, the two models can reconstruct motion across diverse scenes and trajectories, even from highly noisy or incomplete observations. Moreover, our formulation is general, generating the motion of mesh vertices directly and bypassing parametric models. DuoMo achieves state-of-the-art performance. On EMDB, our method obtains a 16% reduction in world-space reconstruction error while maintaining low foot skating. On RICH, it obtains a 30% reduction in world-space error. Project page: https://yufu-wang.github.io/duomo/