Search papers, labs, and topics across Lattice.
CrossEarth-SAR, a billion-scale SAR vision foundation model, was developed using a novel physics-guided sparse mixture-of-experts (MoE) architecture to address domain shifts in SAR imagery for semantic segmentation. The model was pre-trained on the newly created CrossEarth-SAR-200K dataset and evaluated on a benchmark suite of 22 sub-benchmarks spanning 8 domain gaps. Results show CrossEarth-SAR achieves state-of-the-art performance, exceeding previous methods by over 10% mIoU on certain benchmarks under multi-gap transfer, demonstrating strong domain generalization capabilities.
A billion-scale SAR foundation model closes the domain generalization gap in SAR imagery by 10% mIoU, thanks to a physics-guided MoE architecture.
Synthetic Aperture Radar (SAR) enables global, all-weather earth observation. However, owing to diverse imaging mechanisms, domain shifts across sensors and regions severely hinder its semantic generalization. To address this, we present CrossEarth-SAR, the first billion-scale SAR vision foundation model built upon a novel physics-guided sparse mixture-of-experts (MoE) architecture incorporating physical descriptors, explicitly designed for cross-domain semantic segmentation. To facilitate large-scale pre-training, we develop CrossEarth-SAR-200K, a weakly and fully supervised dataset that unifies public and private SAR imagery. We also introduce a benchmark suite comprising 22 sub-benchmarks across 8 distinct domain gaps, establishing the first unified standard for domain generalization semantic segmentation on SAR imagery. Extensive experiments demonstrate that CrossEarth-SAR achieves state-of-the-art results on 20 benchmarks, surpassing previous methods by over 10\% mIoU on some benchmarks under multi-gap transfer. All code, benchmark and datasets will be publicly available.