Search papers, labs, and topics across Lattice.
The paper introduces SurFITR, a new dataset designed to address the limitations of existing forgery detection models when applied to surveillance imagery, which often involves subtle and localized manipulations. SurFITR contains over 137k tampered images generated using a multimodal LLM-powered pipeline to simulate realistic surveillance scenarios with varied viewpoints, occlusions, and lower visual quality. Experiments demonstrate that models trained on SurFITR exhibit improved forgery detection and localization performance, both in-domain and in cross-domain settings, compared to models trained on existing datasets.
Current forgery detection models fall apart on realistic surveillance footage, but SurFITR, a new dataset of subtly manipulated surveillance images, closes the gap.
We present the Surveillance Forgery Image Test Range (SurFITR), a dataset for surveillance-style image forgery detection and localisation, in response to recent advances in open-access image generation models that raise concerns about falsifying visual evidence. Existing forgery models, trained on datasets with full-image synthesis or large manipulated regions in object-centric images, struggle to generalise to surveillance scenarios. This is because tampering in surveillance imagery is typically localised and subtle, occurring in scenes with varied viewpoints, small or occluded subjects, and lower visual quality. To address this gap, SurFITR provides a large collection of forensically valuable imagery generated via a multimodal LLM-powered pipeline, enabling semantically aware, fine-grained editing across diverse surveillance scenes. It contains over 137k tampered images with varying resolutions and edit types, generated using multiple image editing models. Extensive experiments show that existing detectors degrade significantly on SurFITR, while training on SurFITR yields substantial improvements in both in-domain and cross-domain performance. SurFITR is publicly available on GitHub.