Search papers, labs, and topics across Lattice.
The paper introduces ZSG-IAD, a multimodal vision-language framework for zero-shot industrial anomaly detection that generates anomaly reports and pixel-level anomaly masks from RGB, sensor, and 3D point cloud data. A key component is a language-guided two-hop grounding module that selects evidence-like latent slots from multimodal features to produce coarse spatial support, which is then refined into fine-grained masks using channel-spatial gating. The framework further incorporates Executable-Rule GRPO with verifiable rewards to improve the reliability and structure of the generated outputs, demonstrating strong zero-shot performance and more transparent explanations on industrial anomaly benchmarks.
Achieve explainable zero-shot industrial anomaly detection by grounding multimodal data with language and structured rules, offering a path to more trustworthy AI in manufacturing.
Deep learning-based industrial anomaly detectors often behave as black boxes, making it hard to justify decisions with physically meaningful defect evidence. We propose ZSG-IAD, a multimodal vision-language framework for zero-shot grounded industrial anomaly detection. Given RGB images, sensor images, and 3D point clouds, ZSG-IAD generates structured anomaly reports and pixel-level anomaly masks. ZSG-IAD introduces a language-guided two-hop grounding module: (1) anomaly-related sentences select evidence-like latent slots distilled from multimodal features, yielding coarse spatial support; (2) selected slots modulate feature maps via channel-spatial gating and a lightweight decoder to produce fine-grained masks. To improve reliability, we further apply Executable-Rule GRPO with verifiable rewards to promote structured outputs, anomaly-region consistency, and reasoning-conclusion coherence. Experiments across multiple industrial anomaly benchmarks show strong zero-shot performance and more transparent, physically grounded explanations than prior methods. We will release code and annotations to support future research on trustworthy industrial anomaly detection systems.