Search papers, labs, and topics across Lattice.
This paper identifies a critical prediction asymmetry in AI-generated image detectors, where a bias towards real images limits their sensitivity to generated content, particularly after standard post-processing like compression. To counteract this, the authors introduce DEAR (Dissect and Prune), a method that utilizes inpainted images to identify and eliminate spurious features that obscure true generative artifacts. Experimental results show that DEAR significantly improves robustness against unseen generators and post-processing, effectively addressing the prediction asymmetry issue.
AI-generated image detectors can be misled by spurious features, but DEAR prunes these distractions to enhance detection robustness against post-processing.
While existing AI-generated image detectors report high performance, we identify that this is largely driven by a critical prediction asymmetry: a bias toward the real class that severely limits sensitivity to generated content, especially under standard post-processing operations such as compression and resizing. We hypothesize that this stems from the model's reliance on spurious features, distracting signals that obscure true generative artifacts. To address this, we propose DEAR (Dissect and Prune), which leverages inpainted images to identify and prune these interfering components. Specifically, we find that features strongly aligned to either inpainted or non-inpainted regions are less robust to post-processing. By measuring the alignment between channel activations and inpaint masks, DEAR removes features at both extremes, retaining only those that capture genuine generative artifacts. Experimental results demonstrate that our approach significantly enhances robustness against unseen generators and post-processing, effectively mitigating the prediction asymmetry. Our code is available at https://github.com/dahyedahye/dear.