YonseiApr 30, 2026arXiv:2604.27504

REVIVE 3D: Refinement via Encoded Voluminous Inflated prior for Volume Enhancement

H. Lee, Hankyeol Lee, Wooyeol Baek, Seongdok Kim, Seongdo Kim, Jongyoo Kim

AI Summary

REVIVE 3D is introduced as a two-stage pipeline to generate voluminous 3D assets from flat 2D images, addressing the limitations of existing generative models. The pipeline first constructs an Inflated Prior by inflating the foreground silhouette and adding part-aware details, then refines this prior in a latent space using a denoising process guided by geometric cues. Experiments demonstrate state-of-the-art performance on flat image datasets, validated by new metrics (Compactness and Normal Anisotropy) that correlate with human perception of volume and quality.

Key Contribution

Flat 2D images can now be turned into voluminous 3D assets with state-of-the-art fidelity, thanks to a clever inflated-prior and latent-refinement pipeline.

Abstract

Recent generative models have shown strong performance in generating diverse 3D assets from 2D images, a fundamental research topic in computer vision and graphics. However, these models still struggle to generate voluminous 3D assets when the input is a flat image that provides limited 3D cues. We introduce REVIVE 3D, a two-stage, plug-and-play pipeline for generating voluminous 3D assets from flat images. In Stage 1, we construct an Inflated Prior by inflating the foreground silhouette to recover global volume and superimposing part-aware details to capture local structure. In Stage 2, 3D Latent Refinement injects Gaussian noise into the Inflated Prior's latent and then denoises it, using the prior's geometric cues to leverage the backbone's pretrained 3D knowledge. Furthermore, our framework supports image-conditioned 3D editing. To quantify volume and surface flatness, we propose Compactness and Normal Anisotropy. We validate Compactness and Normal Anisotropy through a user study, showing that these metrics align with human perception of volume and quality. We show that REVIVE 3D achieves state-of-the-art performance on a challenging flat image dataset, based on extensive qualitative and quantitative evaluations.

Computer Vision Multimodal Models

Citation Metrics

Citations0

Influential citations0

References0

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

REVIVE 3D: Refinement via Encoded Voluminous Inflated prior for Volume Enhancement

Related Papers