Search papers, labs, and topics across Lattice.
The paper introduces FLIGHT, a novel Hough transform generalization on the unit sphere (S(2)) for robust camera heading estimation from monocular video, particularly in noisy and outlier-ridden conditions. FLIGHT discretizes the unit sphere using a Fibonacci lattice to accumulate votes from great circles derived from feature correspondences, thereby identifying the most consistent motion direction. Experiments across multiple datasets demonstrate FLIGHT's Pareto-optimal performance in accuracy versus efficiency, and its ability to improve SLAM initialization by correcting heading errors, reducing RMSE.
A Fibonacci lattice-based Hough transform on the unit sphere unlocks robust and efficient monocular camera heading estimation, outperforming existing methods in noisy, outlier-ridden scenarios.
Estimating camera motion from monocular video is a fundamental problem in computer vision, central to tasks such as SLAM, visual odometry, and structure-from-motion. Existing methods that recover the camera's heading under known rotation, whether from an IMU or an optimization algorithm, tend to perform well in low-noise, low-outlier conditions, but often decrease in accuracy or become computationally expensive as noise and outlier levels increase. To address these limitations, we propose a novel generalization of the Hough transform on the unit sphere (S(2)) to estimate the camera's heading. First, the method extracts correspondences between two frames and generates a great circle of directions compatible with each pair of correspondences. Then, by discretizing the unit sphere using a Fibonacci lattice as bin centers, each great circle casts votes for a range of directions, ensuring that features unaffected by noise or dynamic objects vote consistently for the correct motion direction. Experimental results on three datasets demonstrate that the proposed method is on the Pareto frontier of accuracy versus efficiency. Additionally, experiments on SLAM show that the proposed method reduces RMSE by correcting the heading during camera pose initialization.