Search papers, labs, and topics across Lattice.
This paper introduces SHARP, a streaming-based motion forecasting framework designed to handle heterogeneous observation lengths by explicitly modeling evolving scenes. SHARP uses instance-aware context streaming to update latent agent representations incrementally and employs a dual training objective to ensure consistent accuracy across different observation horizons. Experiments on Argoverse 2, nuScenes, and Argoverse 1 show SHARP achieves state-of-the-art streaming performance with minimal latency, making it suitable for real-world deployment.
Streaming motion forecasting can now handle variable observation lengths without sacrificing accuracy, unlocking more robust real-world performance.
In dynamic traffic environments, motion forecasting models must be able to accurately estimate future trajectories continuously. Streaming-based methods are a promising solution, but despite recent advances, their performance often degrades when exposed to heterogeneous observation lengths. To address this, we propose a novel streaming-based motion forecasting framework that explicitly focuses on evolving scenes. Our method incrementally processes incoming observation windows and leverages an instance-aware context streaming to maintain and update latent agent representations across inference steps. A dual training objective further enables consistent forecasting accuracy across diverse observation horizons. Extensive experiments on Argoverse 2, nuScenes, and Argoverse 1 demonstrate the robustness of our approach under evolving scene conditions and also on the single-agent benchmarks. Our model achieves state-of-the-art performance in streaming inference on the Argoverse 2 multi-agent benchmark, while maintaining minimal latency, highlighting its suitability for real-world deployment.