Tencent AIMar 2, 2026arXiv:2603.01839

LEAR: Learning Edge-Aware Representations for Event-to-LiDAR Localization

Kuangyi Chen, Jun Zhang, Yuxi Hu, Yuxi Hu, Yi Zhou, Friedrich Fraundorfer, F. Fraundorfer

AI Summary

The paper introduces LEAR, a dual-task learning framework that jointly estimates edge structures and dense event-depth flow fields to address the challenge of aligning sparse, asynchronous event data with dense LiDAR maps for localization. By coupling edge and flow estimation through cross-modal fusion and iterative refinement, LEAR injects modality-invariant geometric cues into the motion representation and enforces mutual consistency between the two tasks. Experiments on challenging datasets demonstrate that LEAR achieves superior pose recovery performance compared to existing methods by producing edge-aware, depth-aligned flow fields suitable for Perspective-n-Point (PnP) solvers.

Key Contribution

By jointly learning edge structures and event-depth flow, LEAR unlocks more robust event camera localization from LiDAR, even when direct correspondence is unreliable.

Abstract

Event cameras offer high-temporal-resolution sensing that remains reliable under high-speed motion and challenging lighting, making them promising for localization from LiDAR point clouds in GPS-denied and visually degraded environments. However, aligning sparse, asynchronous events with dense LiDAR maps is fundamentally ill-posed, as direct correspondence estimation suffers from modality gaps. We propose LEAR, a dual-task learning framework that jointly estimates edge structures and dense event-depth flow fields to bridge the sensing-modality divide. Instead of treating edges as a post-hoc aid, LEAR couples them with flow estimation through a cross-modal fusion mechanism that injects modality-invariant geometric cues into the motion representation, and an iterative refinement strategy that enforces mutual consistency between the two tasks over multiple update steps. This synergy produces edge-aware, depth-aligned flow fields that enable more robust and accurate pose recovery via Perspective-n-Point (PnP) solvers. On several popular and challenging datasets, LEAR achieves superior performance over the best prior method. The source code, trained models, and demo videos are made publicly available online.

Computer Vision Multimodal Models Robotics & Embodied AI

Citation Metrics

Citations0

Influential citations0

References33

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

LEAR: Learning Edge-Aware Representations for Event-to-LiDAR Localization

Related Papers