Search papers, labs, and topics across Lattice.
This paper introduces a novel Structure-from-Motion (SfM) pipeline, GlueMap, that synergistically combines classical SfM techniques with recent feedforward 3D reconstruction methods to overcome limitations of each when used in isolation. GlueMap leverages feedforward methods to handle challenging scenarios like low texture and symmetries, while retaining the scalability and accuracy of classical SfM in standard settings. Experiments across multiple datasets demonstrate state-of-the-art reconstruction performance, showcasing the benefits of the hybrid approach.
Classical SfM can get stuck, and feedforward reconstruction can be brittle, but combining them creates a system that's both robust and accurate.
Structure-from-Motion -- the process of simultaneously estimating camera poses and 3D scene structure from a collection of images -- remains a central challenge in computer vision, with many open problems yet to be solved. Recent advances in feedforward 3D reconstruction have made significant strides in overcoming persistent failure cases of classical SfM methods, particularly in scenarios characterized by low texture, limited overlap, and symmetries. However, while feedforward approaches excel in these challenging conditions, they often face limitations regarding scalability, accuracy, or robustness, and typically fall short of classical methods in standard reconstruction settings. In this work, we systematically analyze these limitations and propose a new Structure-from-Motion pipeline by combining the respective strengths of classical and feedforward methods. Extensive experiments across multiple datasets show the benefits of our approach, achieving state-of-the-art results across a wide range of scenarios. We share our system as an open-source implementation at https://github.com/colmap/gluemap.