Search papers, labs, and topics across Lattice.
Point2Pose is introduced as a model-free method for 6D pose tracking of multiple rigid objects from monocular RGB-D video, initialized only from sparse image points. It leverages a 2D point tracker for long-range correspondences, enabling recovery after complete occlusion, and simultaneously reconstructs an online Truncated Signed Distance Function (TSDF) representation of the tracked targets. Experiments demonstrate performance comparable to state-of-the-art methods on a severe-occlusion benchmark, while supporting multi-object tracking and recovery from complete occlusion.
Track unseen objects through total occlusion without CAD models, using just a handful of 2D points.
We present Point2Pose, a model-free method for causal 6D pose tracking of multiple rigid objects from monocular RGB-D video. Initialized only from sparse image points on the objects to be tracked, our approach tracks multiple unseen objects without requiring object CAD models or category priors. Point2Pose leverages a 2D point tracker to obtain long-range correspondences, enabling instant recovery after complete occlusion. Simultaneously, the system incrementally reconstructs an online Truncated Signed Distance Function (TSDF) representation of the tracked targets. Alongside the method, we introduce a new multi-object tracking dataset comprising both simulation and real-world sequences, with motion-capture ground truth for evaluation. Experiments show that Point2Pose achieves performance comparable to the state-of-the-art methods on a severe-occlusion benchmark, while additionally supporting multi-object tracking and recovery from complete occlusion, capabilities that are not supported by previous model-free tracking approaches.