Search papers, labs, and topics across Lattice.
The paper introduces Sparkle, a novel point cloud representation for human motion capture that unifies skeletal joints and surface anchors with kinematic-geometric factorization. SparkleMotion, the framework built upon Sparkle, learns this representation using hierarchical modules that embed geometric continuity and kinematic constraints. Experiments show that SparkleMotion achieves state-of-the-art performance in accuracy, robustness, and generalization, especially under domain shifts, noise, and occlusion, outperforming existing point-based and skeleton-based methods.
A new point cloud representation, Sparkle, achieves state-of-the-art human motion capture by explicitly disentangling kinematic structure from surface geometry, finally balancing expressiveness and robustness.
Point cloud-based motion capture leverages rich spatial geometry and privacy-preserving sensing, but learning robust representations from noisy, unstructured point clouds remains challenging. Existing approaches face a struggle trade-off between point-based methods (geometrically detailed but noisy) and skeleton-based ones (robust but oversimplified). We address the fundamental challenge: how to construct an effective representation for human motion capture that can balance expressiveness and robustness. In this paper, we propose Sparkle, a structured representation unifying skeletal joints and surface anchors with explicit kinematic-geometric factorization. Our framework, SparkleMotion, learns this representation through hierarchical modules embedding geometric continuity and kinematic constraints. By explicitly disentangling internal kinematic structure from external surface geometry, SparkleMotion achieves state-of-the-art performance not only in accuracy but crucially in robustness and generalization under severe domain shifts, noise, and occlusion. Extensive experiments demonstrate our superiority across diverse sensor types and challenging real-world scenarios.