K training images from 7 scenes and 2.National Key Laboratory of Deep Space ExplorationReconstruct CompleteUSTCMay 25, 2026arXiv:2605.25553

ComPose: A Unified Completion-Pose Framework for Robust Category-Level Object Pose Estimation

Huan Ren, Yihan Chen, Chuxin Wang, Nailong Liu, Wenfei Yang, Tianzhu Zhang

AI Summary

The paper introduces ComPose, a unified framework for category-level object pose estimation that tightly integrates shape completion with pose reasoning. ComPose uses a keypoint-based progressive completion module to recover full shape representations from partial point clouds, enabling more robust pose estimation. A geometric relation consistency loss ensures structural alignment between observed keypoints and predicted normalized object coordinate space (NOCS) coordinates, leading to state-of-the-art performance on standard benchmarks.

Key Contribution

Achieve state-of-the-art category-level object pose estimation by unifying shape completion and pose reasoning, without relying on category-level shape priors.

Abstract

Category-level object pose estimation aims to predict the pose and size of arbitrary objects in specific categories. Existing methods struggle with the inherent incompleteness of observed point clouds, which limits their ability to capture complete object shapes for robust pose reasoning. While point cloud completion offers a promising solution, naively treating it as a separate preprocessing step for partial observations introduces compounding errors and additional computational overhead, ultimately hindering both accuracy and efficiency. To address these challenges, we propose ComPose, a novel unified framework that tightly integrates shape completion to provide complete geometric cues for enhanced pose estimation. At the core of ComPose is a keypoint-based progressive completion module, which recovers full shape representations by progressively predicting a sparse set of keypoints and their surrounding dense point sets, empowering the keypoints to capture holistic object geometries. A geometric relation encoding module further enriches keypoint features with both local and global geometric context. In addition, we introduce a novel geometric relation consistency loss to enforce structural alignment between observed keypoints and their predicted NOCS coordinates, ensuring globally coherent coordinate transformations. Extensive experiments on standard benchmarks demonstrate that our method outperforms state-of-the-art approaches without relying on category-level shape priors.

Computer Vision Robotics & Embodied AI

Citation Metrics

Citations0

Influential citations0

References0

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

ComPose: A Unified Completion-Pose Framework for Robust Category-Level Object Pose Estimation

Related Papers