Search papers, labs, and topics across Lattice.
This paper introduces MU-GeNeRF, a Multi-view Uncertainty-guided framework that enhances Generalizable Neural Radiance Fields (GeNeRFs) by effectively managing transient distractors that disrupt scene reconstruction. By decomposing distractor awareness into Source-view and Target-view Uncertainty, the method allows for adaptive modulation of supervision, leading to improved robustness in geometric modeling. Experimental results demonstrate that MU-GeNeRF outperforms existing GeNeRF approaches and matches the performance of specialized distractor-free NeRFs, highlighting its effectiveness in real-world applications.
Transient distractions can severely degrade scene reconstruction, but MU-GeNeRF effectively mitigates their impact, achieving results on par with specialized methods.
Generalizable Neural Radiance Fields (GeNeRFs) enable high-quality scene reconstruction from sparse views and can generalize to unseen scenes. However, in real-world settings, transient distractors break cross-view structural consistency, corrupting supervision and degrading reconstruction quality. Existing distractor-free NeRF methods rely on per-scene optimization and estimate uncertainty from per-view reconstruction errors, which are not reliable for GeNeRFs and often misjudge inconsistent static structures as distractors. To this end, we propose MU-GeNeRF, a Multi-view Uncertainty-guided distractor-aware GeNeRF framework designed to alleviate GeNeRF's robust modeling challenges in the presence of transient distractions. We decompose distractor awareness into two complementary uncertainty components: Source-view Uncertainty, which captures structural discrepancies across source views caused by viewpoint changes or dynamic factors; and Target-view Uncertainty, which detects observation anomalies in the target image induced by transient distractors.These two uncertainties address distinct error sources and are combined through a heteroscedastic reconstruction loss, which guides the model to adaptively modulate supervision, enabling more robust distractor suppression and geometric modeling.Extensive experiments show that our method not only surpasses existing GeNeRFs but also achieves performance comparable to scene-specific distractor-free NeRFs.