Search papers, labs, and topics across Lattice.
This paper introduces a training-free zero-shot anomaly detection (ZSAD) framework for 3D brain MRI by aggregating multi-axis 2D slices processed by foundation models into localized volumetric tokens. By constructing 3D patch tokens from 2D foundation model features, the method captures cubic spatial context without requiring fine-tuning or supervision. Experiments demonstrate the effectiveness of extending training-free, batch-based ZSAD from 2D encoders to full 3D MRI volumes.
Turns out you can get surprisingly good 3D medical anomaly detection without any training, just by cleverly aggregating 2D foundation model features.
Zero-shot anomaly detection (ZSAD) has gained increasing attention in medical imaging as a way to identify abnormalities without task-specific supervision, but most advances remain limited to 2D datasets. Extending ZSAD to 3D medical images has proven challenging, with existing methods relying on slice-wise features and vision-language models, which fail to capture volumetric structure. In this paper, we introduce a fully training-free framework for ZSAD in 3D brain MRI that constructs localized volumetric tokens by aggregating multi-axis slices processed by 2D foundation models. These 3D patch tokens restore cubic spatial context and integrate directly with distance-based, batch-level anomaly detection pipelines. The framework provides compact 3D representations that are practical to compute on standard GPUs and require no fine-tuning, prompts, or supervision. Our results show that training-free, batch-based ZSAD can be effectively extended from 2D encoders to full 3D MRI volumes, offering a simple and robust approach for volumetric anomaly detection.