Search papers, labs, and topics across Lattice.
This paper introduces a structure-aware densification framework for 3D Gaussian Splatting that accelerates convergence and improves reconstruction quality. The method uses multi-scale frequency analysis, combining structure tensors and Laplacian scale space analysis, to estimate the dominant frequency at each pixel and guide anisotropic Gaussian splitting based on a novel frequency violation metric. Experiments on standard benchmarks demonstrate faster convergence and superior reconstruction quality, especially in high-frequency regions, compared to existing methods.
Stop blurring the details: structure-aware Gaussian Splatting densification uses frequency analysis to resolve high-frequency textures faster and with higher quality.
3D Gaussian Splatting has emerged as a powerful scene representation for real-time novel-view synthesis. However, its standard adaptive density control relies on screen-space positional gradients, which do not distinguish between geometric misplacement and frequency aliasing, often leading to either over-blurred high-frequency textures or inefficient over-densification. We present a structure-aware densification framework. Our key insight is that the decision to subdivide a Gaussian should be driven by an explicit comparison between its projected screen-space extent and the local structure of the texture it seeks to represent. We introduce a multi-scale frequency analysis combining structure tensors with Laplacian scale space analysis to estimate the dominant frequency at each pixel, enabling robust supervision across varying texture scales. Based on this analysis, we define $\eta$, a per-Gaussian, per-axis frequency violation metric that indicates when a primitive may be under-resolving local texture details. Unlike methods that perform isotropic splitting (e.g., splitting each Gaussian into two smaller ones with uniform shape), our approach performs anisotropic splitting. For each axis with high $\eta$, we compute a split factor to better resolve the local frequency content. We further introduce a multiview consistency criterion that aggregates $\eta$ observations across multiple views. By performing densification early and faster, we skip the lengthy iterative densification phases required by baseline methods and achieve significantly faster convergence. Experiments on standard benchmarks demonstrate that our method also achieves superior reconstruction quality, particularly in high-frequency regions.