Mar 4, 2026arXiv:2603.03983

GeoSeg: Training-Free Reasoning-Driven Segmentation in Remote Sensing Imagery

Lifan Jiang, Yuhang Pei, oxi Wu, Yan Zhao, Tianrun Wu, Shulong Yu, Lihui Zhang, Deng Cai

AI Summary

GeoSeg is introduced as a training-free framework for reasoning-driven remote sensing image segmentation, addressing the lack of generalizable solutions due to data costs and domain-specific challenges. It combines MLLM reasoning with precise localization through bias-aware coordinate refinement and a dual-route prompting mechanism. Evaluated on the newly introduced GeoSeg-Bench benchmark, GeoSeg outperforms existing baselines, demonstrating the efficacy of its components.

Key Contribution

Achieve zero-shot remote sensing image segmentation by cleverly fusing MLLM reasoning with geometric refinement, sidestepping the need for costly training data.

Abstract

Recent advances in MLLMs are reframing segmentation from fixed-category prediction to instruction-grounded localization. While reasoning based segmentation has progressed rapidly in natural scenes, remote sensing lacks a generalizable solution due to the prohibitive cost of reasoning-oriented data and domain-specific challenges like overhead viewpoints. We present GeoSeg, a zero-shot, training-free framework that bypasses the supervision bottleneck for reasoning-driven remote sensing segmentation. GeoSeg couples MLLM reasoning with precise localization via: (i) bias-aware coordinate refinement to correct systematic grounding shifts and (ii) a dual-route prompting mechanism to fuse semantic intent with fine-grained spatial cues. We also introduce GeoSeg-Bench, a diagnostic benchmark of 810 image--query pairs with hierarchical difficulty levels. Experiments show that GeoSeg consistently outperforms all baselines, with extensive ablations confirming the effectiveness and necessity of each component.

Computer Vision Multimodal Models Reasoning & Chain-of-Thought

Citation Metrics

Citations0

Influential citations0

References0

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

GeoSeg: Training-Free Reasoning-Driven Segmentation in Remote Sensing Imagery

Related Papers