Search papers, labs, and topics across Lattice.
This paper addresses the challenge of detecting and explaining localized manipulations in untrimmed long videos, a scenario inadequately handled by existing forensic methods that focus on short clips. The authors introduce TASLE, a comprehensive benchmark dataset with 12,472 videos annotated for authenticity, temporal boundaries, and segment-level rationales, alongside a novel forensic approach called MSLoc that enhances detection accuracy through a coarse-to-fine strategy. Experimental results demonstrate that segment-level explainability significantly improves the analysis of AI-generated content, underscoring the need for advanced forensic techniques in the age of misinformation.
Segment-level explainable forensics can drastically enhance our ability to detect and interpret localized manipulations in lengthy AI-generated videos.
The rapid advancement of AI-driven video generation has transformed content creation, while simultaneously increasing the risk of misinformation through localized manipulations in long-form videos. Existing video forensic methods predominantly operate on short, independent clips, and thus fail to capture realistic scenarios where AI-generated content is sparsely embedded within otherwise authentic footage. To bridge this gap, we formulate the task of Temporal AI-Generated Segment Localization and Explanation, which targets authenticity detection, temporal localization, and interpretable analysis of manipulated segments in untrimmed long videos. We further introduce TASLE, a large-scale benchmark comprising 12,472 untrimmed videos with diverse manipulation patterns and rich annotation signals, including temporal boundaries, authenticity labels, and segment-level rationales. In addition, we propose MSLoc, a coarse-to-fine forensic baseline that combines a boundary-sensitive proposal generation module for efficient long-video scanning with an MLLM-based refinement module for precise boundary localization and interpretable reasoning. Experiments validate the effectiveness of the proposed baseline, highlighting the importance of segment-level explainable forensics for long-form AI-generated video analysis. Our dataset and code are publicly available at https://debby-0527.github.io/TASLE.