Mar 5, 2026arXiv:2603.04882

DeformTrace: A Deformable State Space Model with Relay Tokens for Temporal Forgery Localization

Xiaodong Zhu, Suting Wang, Yuanming Zheng, Junqi Yang, Yang Liao, Yangxu Liao, Yuhong Yang, Weiping Tu, Zhongyuan Wang

AI Summary

The paper introduces DeformTrace, a novel state space model (SSM) architecture for temporal forgery localization (TFL) in videos and audio. DeformTrace enhances SSMs with deformable dynamics via Deformable Self-SSM (DS-SSM) and a Relay Token Mechanism to improve temporal reasoning and mitigate long-range decay. Additionally, Deformable Cross-SSM (DC-SSM) reduces non-forgery information accumulation, leading to state-of-the-art TFL performance with improved efficiency and robustness.

Key Contribution

DeformTrace achieves state-of-the-art temporal forgery localization by equipping state space models with deformable dynamics and relay mechanisms, outperforming transformers with fewer parameters and faster inference.

Abstract

Temporal Forgery Localization (TFL) aims to precisely identify manipulated segments in video and audio, offering strong interpretability for security and forensics. While recent State Space Models (SSMs) show promise in precise temporal reasoning, their use in TFL is hindered by ambiguous boundaries, sparse forgeries, and limited long-range modeling. We propose DeformTrace, which enhances SSMs with deformable dynamics and relay mechanisms to address these challenges. Specifically, Deformable Self-SSM (DS-SSM) introduces dynamic receptive fields into SSMs for precise temporal localization. To further enhance its capacity for temporal reasoning and mitigate long-range decay, a Relay Token Mechanism is integrated into DS-SSM. Besides, Deformable Cross-SSM (DC-SSM) partitions the global state space into query-specific subspaces, reducing non-forgery information accumulation and boosting sensitivity to sparse forgeries. These components are integrated into a hybrid architecture that combines the global modeling of Transformers with the efficiency of SSMs. Extensive experiments show that DeformTrace achieves state-of-the-art performance with fewer parameters, faster inference, and stronger robustness.

Architecture Design (Transformers, SSMs, MoE)Computer Vision Speech & Audio

Citation Metrics

Citations0

Influential citations0

References41

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

DeformTrace: A Deformable State Space Model with Relay Tokens for Temporal Forgery Localization

Related Papers