NUSMBZUAIJun 1, 2026arXiv:2606.01545

Hierarchical Object Representation for Spatial Robot Perception: Points, Meshes, and Superquadrics

Ceng Zhang, Wan Su, Mohamed Samshad, Gregory S. Chirikjian, Rajat Talak

AI Summary

This paper introduces a hierarchical object representation for 3D Scene Graphs (3DSG) that enhances spatial robot perception by integrating metric, semantic, and topological information. The proposed method utilizes a four-layer structure that transitions from raw sensor data to dense 3D meshes and analytical primitives like superquadrics, enabling high-fidelity object reconstruction and robust navigation in complex environments. Experimental results demonstrate that the superquadric-based map alignment significantly outperforms existing methods, showcasing its effectiveness in real-world scenarios across various datasets.

Key Contribution

Superquadric representations enable robots to achieve unprecedented accuracy in object-level reconstruction and navigation, outperforming state-of-the-art methods in cluttered environments.

Abstract

Hierarchical 3D Scene Graphs (3DSG) have emerged as an actionable and scalable representation for long-term autonomy incorporating metric, semantic, and topological information in the scene. However, the question of geometric representation of objects in 3DSG has been overlooked as most methods use simplified geometric models such as partial point clouds or 3D bounding boxes. In this work, we introduce a hierarchical object representation that can be leveraged for high-fidelity object-level reconstruction, object-based robust re-localization or map alignment, and efficient and analytical collision checking for safe robot navigation planning in dense and cluttered environments. The representation is structurally organized into four distinct layers, progressively abstracting the scene from raw sensor data to dense 3D meshes to analytical primitives such as superquadrics, which provide a sparse and analytical representation for object geometry. We develop a pipeline that builds the hierarchical object representation from RGB-D image stream captured by a robot, and demonstrate its working in real-world open-set object scenes in both indoor and outdoor environments. Extensive experiments across diverse datasets including HOPE, ReplicaCAD, Kimera-Multi, and NUS Campus Dataset collected using Unitree B2 Robot validate our pipeline in both indoor and outdoor environments. We show that our superquadric-based map alignment method outperforms the current state-of-the-art object based map alignment method ROMAN. Our code can be found at https://github.com/perceptica-robotics/Hickory.

Computer Vision Robotics & Embodied AI

Citation Metrics

Citations0

Influential citations0

References0

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

Hierarchical Object Representation for Spatial Robot Perception: Points, Meshes, and Superquadrics

Related Papers