Search papers, labs, and topics across Lattice.
The paper introduces EventTrack6D, an event-depth tracking framework for 6D object pose estimation that generalizes to novel objects without object-specific training. It leverages a dual reconstruction approach to recover dense photometric and geometric cues from sparse event streams, conditioned on depth measurements. The method achieves 6D pose tracking at over 120 FPS and generalizes effectively from synthetic training data to real-world scenarios, as validated by a newly introduced benchmark suite.
Event cameras unlock 6D pose tracking of novel objects at 120+ FPS, even with rapid motion, by fusing sparse event streams with depth in a way that generalizes zero-shot from synthetic training.
Event cameras provide microsecond latency, making them suitable for 6D object pose tracking in fast, dynamic scenes where conventional RGB and depth pipelines suffer from motion blur and large pixel displacements. We introduce EventTrack6D, an event-depth tracking framework that generalizes to novel objects without object-specific training by reconstructing both intensity and depth at arbitrary timestamps between depth frames. Conditioned on the most recent depth measurement, our dual reconstruction recovers dense photometric and geometric cues from sparse event streams. Our EventTrack6D operates at over 120 FPS and maintains temporal consistency under rapid motion. To support training and evaluation, we introduce a comprehensive benchmark suite: a large-scale synthetic dataset for training and two complementary evaluation sets, including real and simulated event datasets. Trained exclusively on synthetic data, EventTrack6D generalizes effectively to real-world scenarios without fine-tuning, maintaining accurate tracking across diverse objects and motion patterns. Our method and datasets validate the effectiveness of event cameras for event-based 6D pose tracking of novel objects. Code and datasets are publicly available at https://chohoonhee.github.io/Event6D.