D observationsKAISTM utterances from 6May 25, 2026arXiv:2605.26230

Geometry-Aware Representation Denoising for Robust Multi-view 3D Reconstruction

Jin Hyeon Kim, Jaeeun Lee, Claire Kim, Kyoungjin Oh, Paul Hyunbin Cho, Jaewon Min, Yeji Choi, Jihye Park, Hyunhee Park, Minkyu Park, Seungryong Kim

AI Summary

This paper introduces Geometry-Aware Representation Denoising (GARD), a diffusion-based framework for robust multi-view 3D reconstruction that operates directly in the feature space of a feed-forward 3D reconstruction model. GARD leverages geometry-aware feature representations to denoise and refine features, enabling simultaneous recovery of accurate 3D scene geometry and high-quality RGB images. Experiments on the DA3 benchmark demonstrate GARD's effectiveness in handling degraded imaging conditions.

Key Contribution

Denoising directly in a 3D reconstructor's feature space unlocks simultaneous recovery of scene geometry and high-quality imagery from degraded multi-view inputs.

Abstract

Multi-view 3D reconstruction has achieved remarkable progress with the advent of feed-forward 3D reconstruction models. However, these models are typically trained and evaluated under ideal, degradation-free imaging conditions, whereas real-world observations often contain degradations that differ significantly from such settings. Improving robustness for multi-view 3D reconstruction under degraded conditions therefore remains an important challenge. We present Geometry-Aware Representation Denoising (GARD), a novel framework that performs diffusion-based multi-view restoration directly in the feature space of a feed-forward 3D reconstruction model. This design exploits the geometry-aware feature representations of the 3D reconstructor to effectively recover accurate scene geometry. Furthermore, by employing an additional RGB image decoder, the refined representations can also be used to restore high-quality RGB images, thereby enabling the simultaneous recovery of 3D scene geometry and high-quality imagery. Comprehensive experiments on the Depth Anything 3 (DA3) benchmark demonstrate the effectiveness of the proposed GARD framework.

Computer Vision

Citation Metrics

Citations0

Influential citations0

References0

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

Geometry-Aware Representation Denoising for Robust Multi-view 3D Reconstruction

Related Papers