Search papers, labs, and topics across Lattice.
The paper introduces B$^3$-Seg, a novel method for open-vocabulary 3D Gaussian Splatting (3DGS) segmentation that operates without camera viewpoints, training, or ground-truth labels. It formulates segmentation as sequential Beta-Bernoulli Bayesian updates, leveraging analytic Expected Information Gain (EIG) for active view selection. The method achieves competitive segmentation performance compared to supervised approaches while maintaining low latency, demonstrating its practicality for interactive 3DGS editing.
Ditch the training data and camera views: B$^3$-Seg segments 3D Gaussian Splattings in seconds using a clever Beta-Bernoulli Bayesian update scheme.
Interactive 3D Gaussian Splatting (3DGS) segmentation is essential for real-time editing of pre-reconstructed assets in film and game production. However, existing methods rely on predefined camera viewpoints, ground-truth labels, or costly retraining, making them impractical for low-latency use. We propose B$^3$-Seg (Beta-Bernoulli Bayesian Segmentation for 3DGS), a fast and theoretically grounded method for open-vocabulary 3DGS segmentation under camera-free and training-free conditions. Our approach reformulates segmentation as sequential Beta-Bernoulli Bayesian updates and actively selects the next view via analytic Expected Information Gain (EIG). This Bayesian formulation guarantees the adaptive monotonicity and submodularity of EIG, which produces a greedy $(1{-}1/e)$ approximation to the optimal view sampling policy. Experiments on multiple datasets show that B$^3$-Seg achieves competitive results to high-cost supervised methods while operating end-to-end segmentation within a few seconds. The results demonstrate that B$^3$-Seg enables practical, interactive 3DGS segmentation with provable information efficiency.