Mar 12, 2026arXiv:2603.12008

CrossEarth-SAR: A SAR-Centric and Billion-Scale Geospatial Foundation Model for Domain Generalizable Semantic Segmentation

Ziqi Ye, Ziyang Gong, Ning Liao, Xiaoxing Hu, Di Wang, Hongruixuan Chen, Chen Huang, Yuru Jia, Xiaoxing Wang, Haipeng Wang, Junchi Yan

AI Summary

CrossEarth-SAR, a billion-scale SAR vision foundation model, was developed using a novel physics-guided sparse mixture-of-experts (MoE) architecture to address domain shifts in SAR imagery for semantic segmentation. The model was pre-trained on the newly created CrossEarth-SAR-200K dataset and evaluated on a benchmark suite of 22 sub-benchmarks spanning 8 domain gaps. Results show CrossEarth-SAR achieves state-of-the-art performance, exceeding previous methods by over 10% mIoU on certain benchmarks under multi-gap transfer, demonstrating strong domain generalization capabilities.

Key Contribution

A billion-scale SAR foundation model closes the domain generalization gap in SAR imagery by 10% mIoU, thanks to a physics-guided MoE architecture.

Abstract

Synthetic Aperture Radar (SAR) enables global, all-weather earth observation. However, owing to diverse imaging mechanisms, domain shifts across sensors and regions severely hinder its semantic generalization. To address this, we present CrossEarth-SAR, the first billion-scale SAR vision foundation model built upon a novel physics-guided sparse mixture-of-experts (MoE) architecture incorporating physical descriptors, explicitly designed for cross-domain semantic segmentation. To facilitate large-scale pre-training, we develop CrossEarth-SAR-200K, a weakly and fully supervised dataset that unifies public and private SAR imagery. We also introduce a benchmark suite comprising 22 sub-benchmarks across 8 distinct domain gaps, establishing the first unified standard for domain generalization semantic segmentation on SAR imagery. Extensive experiments demonstrate that CrossEarth-SAR achieves state-of-the-art results on 20 benchmarks, surpassing previous methods by over 10\% mIoU on some benchmarks under multi-gap transfer. All code, benchmark and datasets will be publicly available.

Architecture Design (Transformers, SSMs, MoE)Computer Vision Data Curation & Synthetic Data

Citation Metrics

Citations0

Influential citations0

References68

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

CrossEarth-SAR: A SAR-Centric and Billion-Scale Geospatial Foundation Model for Domain Generalizable Semantic Segmentation

Related Papers