Search papers, labs, and topics across Lattice.
This paper introduces Cross Pseudo Labeling for Video Anomaly Detection (CPL-VAD), a dual-branch framework designed for weakly supervised video anomaly detection using only video-level labels. CPL-VAD employs a binary anomaly detection branch for snippet-level localization and a category classification branch utilizing vision-language alignment for recognizing abnormal event categories. The key result is state-of-the-art performance on XD-Violence and UCF-Crime datasets, achieved by exchanging pseudo labels between the two branches to combine temporal precision with semantic discrimination.
Cross pseudo labeling between anomaly detection and category classification branches substantially boosts weakly supervised video anomaly detection, achieving state-of-the-art results.
Weakly supervised video anomaly detection aims to detect anomalies and identify abnormal categories with only video-level labels. We propose CPL-VAD, a dual-branch framework with cross pseudo labeling. The binary anomaly detection branch focuses on snippet-level anomaly localization, while the category classification branch leverages vision-language alignment to recognize abnormal event categories. By exchanging pseudo labels, the two branches transfer complementary strengths, combining temporal precision with semantic discrimination. Experiments on XD-Violence and UCF-Crime demonstrate that CPL-VAD achieves state-of-the-art performance in both anomaly detection and abnormal category classification.