HKUJun 1, 2026arXiv:2606.02268

From Extrinsic to Intrinsic: Geodesic-Guided Representation Learning for 3D Geometric Data

Yuming Zhao, Junhui Hou, Qijian Zhang, Jia Qin, Ying He

AI Summary

This paper introduces PRISM, a novel 3D representation learning paradigm that shifts focus from extrinsic spatial structures to intrinsic surface geodesic metrics, enabling the learning of isometric embeddings. By incorporating a topology-enforcing objective and a two-stage training approach to address sample imbalance in geodesic distances, PRISM effectively captures the essence of shape identity and manifold topology. Experimental results reveal that PRISM not only excels in geodesic distance prediction but also outperforms existing methods in various downstream tasks such as shape recognition and surface parameterization.

Key Contribution

PRISM achieves superior performance in 3D representation learning by leveraging intrinsic geometric properties, outperforming traditional methods that rely on extrinsic features.

Abstract

Geometric analysis fundamentally distinguishes between \textit{extrinsic} and \textit{intrinsic} perspectives. The dominant paradigm in current 3D representation learning relies on either extrinsic spatial structures or high-level semantics, struggling to capture the essence of shape identity and underlying manifold topology. To bridge this gap, we introduce a novel 3D representation learning paradigm, namely \textbf{PRISM}, for \textbf{P}re-training, which learns isometric embeddings by \textbf{R}ecovering the \textbf{I}ntrinsic \textbf{S}urface geodesic \textbf{M}etric. PRISM incorporates a topology-enforcing objective that explicitly constrains the structure of latent space, alongside a specialized two-stage training recipe mitigating sample imbalance inherent in the distribution of geodesic distances. Experiments demonstrate that our approach shows satisfactory accuracy, robustness, and high efficiency in geodesic distance prediction and achieves superior performance across diverse downstream tasks, including shape recognition, surface parameterization, and non-rigid correspondence. The code will be publicly available at https://github.com/AidenZhao/PRISM.

Computer Vision

Citation Metrics

Citations0

Influential citations0

References0

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

From Extrinsic to Intrinsic: Geodesic-Guided Representation Learning for 3D Geometric Data

Related Papers