9]. ***Code and trained models will be made publicly available upon acceptance.Mar 29, 2026arXiv:2603.27757

E-TIDE: Fast, Structure-Preserving Motion Forecasting from Event Sequences

Biswadeep Sen, Benoit R. Cottereau, Nicolas Cuperlier, Terence Sim

AI Summary

The paper introduces E-TIDE, a novel and lightweight architecture for predicting future event representations from event-based camera data. E-TIDE leverages the TIDE module, which employs large-kernel mixing and activity-aware gating for efficient spatiotemporal interaction on sparse event tensors. Experiments on standard datasets show that E-TIDE achieves competitive performance with significantly reduced model size and training requirements compared to existing state-of-the-art approaches.

Key Contribution

Event-based vision gets a lightweight, efficient boost: E-TIDE matches state-of-the-art forecasting accuracy while slashing model size and training costs.

Abstract

Event-based cameras capture visual information as asynchronous streams of per-pixel brightness changes, generating sparse, temporally precise data. Compared to conventional frame-based sensors, they offer significant advantages in capturing high-speed dynamics while consuming substantially less power. Predicting future event representations from past observations is an important problem, enabling downstream tasks such as future semantic segmentation or object tracking without requiring access to future sensor measurements. While recent state-of-the-art approaches achieve strong performance, they often rely on computationally heavy backbones and, in some cases, large-scale pretraining, limiting their applicability in resource-constrained scenarios. In this work, we introduce E-TIDE, a lightweight, end-to-end trainable architecture for event-tensor prediction that is designed to operate efficiently without large-scale pretraining. Our approach employs the TIDE module (Temporal Interaction for Dynamic Events), motivated by efficient spatiotemporal interaction design for sparse event tensors, to capture temporal dependencies via large-kernel mixing and activity-aware gating while maintaining low computational complexity. Experiments on standard event-based datasets demonstrate that our method achieves competitive performance with significantly reduced model size and training requirements, making it well-suited for real-time deployment under tight latency and memory budgets.

Architecture Design (Transformers, SSMs, MoE)Computer Vision Robotics & Embodied AI

Citation Metrics

Citations0

Influential citations0

References0

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

E-TIDE: Fast, Structure-Preserving Motion Forecasting from Event Sequences

Related Papers