Search papers, labs, and topics across Lattice.
LAA-X, a new deepfake detection framework, uses explicit attention guided by auxiliary tasks within a multi-task learning setup to focus on artifact-prone regions. This approach enhances generalization to unseen manipulations by training on real and pseudo-fake samples, unlike existing methods that rely on binary classifiers and implicit attention. Experiments show LAA-X achieves state-of-the-art performance across multiple benchmarks despite not being trained on real forgeries.
Spot deepfakes like a pro: LAA-X uses explicit attention to find tell-tale artifacts, even in high-quality forgeries and unseen manipulations.
In this paper, we propose Localized Artifact Attention X (LAA-X), a novel deepfake detection framework that is both robust to high-quality forgeries and capable of generalizing to unseen manipulations. Existing approaches typically rely on binary classifiers coupled with implicit attention mechanisms, which often fail to generalize beyond known manipulations. In contrast, LAA-X introduces an explicit attention strategy based on a multi-task learning framework combined with blending-based data synthesis. Auxiliary tasks are designed to guide the model toward localized, artifact-prone (i.e., vulnerable) regions. The proposed framework is compatible with both CNN and transformer backbones, resulting in two different versions, namely, LAA-Net and LAA-Former, respectively. Despite being trained only on real and pseudo-fake samples, LAA-X competes with state-of-the-art methods across multiple benchmarks. Code and pre-trained weights for LAA-Net\footnote{https://github.com/10Ring/LAA-Net} and LAA-Former\footnote{https://github.com/10Ring/LAA-Former} are publicly available.