Apr 2, 2026arXiv:2604.01987

Curia-2: Scaling Self-Supervised Learning for Radiology Foundation Models

Antoine Saporta, A. Saporta, Baptiste Callard, Corentin Dancette, Julien Khlaut, Charles Corbière, L'eo Butsanets, Amaury Prat, P. Manceron, Pierre Manceron

AI Summary

Curia-2 enhances radiology foundation models by refining the original Curia pre-training strategy to better capture radiological data specificities, enabling the scaling of architectures to billion-parameter Vision Transformers for multi-modal CT and MRI data. They formalized evaluation by extending CuriaBench into 2D and 3D tracks for slice-based and volumetric benchmarking, respectively. Results show Curia-2 outperforms all vision-focused FMs and competes with vision-language models on complex clinical tasks.

Key Contribution

Billion-parameter Vision Transformers can now be effectively pre-trained on multi-modal CT and MRI data, achieving state-of-the-art performance on vision-focused radiology tasks.

Abstract

The rapid growth of medical imaging has fueled the development of Foundation Models (FMs) to reduce the growing, unsustainable workload on radiologists. While recent FMs have shown the power of large-scale pre-training to CT and MRI analysis, there remains significant room to optimize how these models learn from complex radiological volumes. Building upon the Curia framework, this work introduces Curia-2, which significantly improves the original pre-training strategy and representation quality to better capture the specificities of radiological data. The proposed methodology enables scaling the architecture up to billion-parameter Vision Transformers, marking a first for multi-modal CT and MRI FMs. Furthermore, we formalize the evaluation of these models by extending and restructuring CuriaBench into two distinct tracks: a 2D track tailored for slice-based vision models and a 3D track for volumetric benchmarking. Our results demonstrate that Curia-2 outperforms all FMs on vision-focused tasks and fairs competitively to vision-language models on clinically complex tasks such as finding detection. Weights will be made publicly available to foster further research.

Computer Vision Data Curation & Synthetic Data Training Efficiency & Optimization

Citation Metrics

Citations0

Influential citations0

References24

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

Curia-2: Scaling Self-Supervised Learning for Radiology Foundation Models

Related Papers