Search papers, labs, and topics across Lattice.
This paper introduces SeeGroup, a novel method for estimating multilayer depth of transparent surfaces without relying on predefined groupings of 3D points. By formulating multilayer depth as a point process, SeeGroup allows for adaptive surface assignment, leading to a permutation-invariant likelihood that supports arbitrary layer configurations. Experimental results show a significant improvement in quadruplet relative depth accuracy on the LayeredDepth benchmark, increasing from 61.34% to 70.09%.
SeeGroup achieves a remarkable 14.75% increase in depth estimation accuracy for transparent surfaces by allowing adaptive layer grouping rather than relying on fixed strategies.
Transparent objects are common in daily life, and it is important to understand their multilayer depth, including the transparent surface and the objects behind it. Existing methods for multilayer depth typically extend single-layer prediction. They define layers by the front-to-back ordering of 3D points and predict the layers sequentially. However, as layered geometry can admit multiple valid groupings of 3D points into layers, a predefined grouping strategy is inherently restrictive. In this work, we propose SeeGroup, a multi-layer depth estimation method that avoids imposing a predefined grouping and allows the model itself to adaptively assign surfaces to depth maps. We formulate per-pixel multi-layer depth as a point process, treating depth layers as unordered events along each camera ray. This induces a permutation-invariant likelihood over the observed depth layers, yielding a loss that naturally supports arbitrary layer groupings. Experiments demonstrate that our method significantly advances the state of the art of multi-layer depth estimation, improving quadruplet relative depth accuracy on LayeredDepth benchmark from 61.34% to 70.09%. Code is available at https://github.com/princeton-vl/SeeGroup.