Search papers, labs, and topics across Lattice.
This paper introduces EntroAD, a zero-shot anomaly detection framework that uses structural entropy to dynamically route image patches to specialized adaptation strategies. EntroAD estimates patch-level structural entropy from self-attention to guide anomaly-aware token routing, enabling the model to handle diverse anomaly characteristics. Experiments across 10 datasets demonstrate state-of-the-art performance in cross-dataset zero-shot anomaly detection.
Achieve SOTA zero-shot anomaly detection by dynamically routing image patches based on structural entropy, adapting to heterogeneous anomaly patterns without target-domain fine-tuning.
Zero-Shot Anomaly Detection (ZSAD) aims to detect anomalies in unseen domains without target-domain adaptation. Recent CLIP-based methods have shown promising performance by leveraging prompt learning and visual-text alignment. However, most existing approaches rely on a single adaptation pathway, which may be insufficient for heterogeneous anomaly patterns across domains. In practice, anomalies exhibit vastly different characteristics, ranging from salient, localized structural disruptions to subtle, diffuse, and irregular variations. To address this challenge, we propose EntroAD, a structural entropy-guided zero-shot anomaly detection framework. Unlike previous methods, EntroAD introduces a dynamic routing mechanism to process different types of anomalies with specialized adaptation strategies. Specifically, we estimate patch-level structural entropy from self-attention-induced patch relations and use it as a proxy for relational uncertainty to guide anomaly-aware token routing. Based on this routing signal, we construct anomaly-aware routed tokens to better capture anomaly cues with different structural characteristics. We further introduce a confidence-aware dual-branch prompt adaptation module to stabilize visual-text alignment while preserving CLIP's transferable prior. Extensive experiments on 10 industrial and medical benchmarks show that EntroAD achieves state-of-the-art performance in challenging cross-dataset ZSAD settings.