Zhitong Xiong

Papers on Lattice

Total citations

Topics

h-index

Publication activitypapers/week, last 8 weeks

Research focus

Computer Vision (2)Multimodal Models (2)Reasoning & Chain-of-Thought (1)Architecture Design (Transformers, SSMs, MoE) (1)

Frequent co-authors

Yan Shu (3)B. Demir (3)Paolo Rota (3)B. Ren (2)

Papers (3)

2025

Yan Shu +72025·also Trento

EarthMind: Towards Multi-Granular and Multi-Sensor Earth Observation with Large Multimodal Models

This work presents EarthMind, a novel vision-language framework for multi-granular and multi-sensor EO data understanding and outperforms existing methods on multiple public EO benchmarks, showcasing its potential to handle both multi-granular and multi-sensor challenges in a unified framework.

Yan Shu, B. Ren, Zhitong Xiong +57

Mar 19, 2026

Yan Shu +6Mar 19, 2026

TerraScope: Pixel-Grounded Visual Reasoning for Earth Observation

Pixel-perfect geospatial reasoning is now possible, thanks to a vision-language model that adaptively fuses multi-modal and multi-temporal Earth observation data.

Yan Shu, B. Ren, Zhitong Xiong +4

Computer Vision Multimodal Models Reasoning & Chain-of-Thought

Jun 2, 2025

EarthMind: Leveraging Cross-Sensor Data for Advanced Earth Observation Interpretation with a Unified Multimodal LLM

EarthMind demonstrates that hierarchical cross-modal attention across optical and SAR data significantly boosts MLLM performance on Earth Observation tasks, outperforming models limited to single-sensor inputs.

Yan Shu, Bin Ren, Zhitong Xiong +5

Architecture Design (Transformers, SSMs, MoE)Computer Vision Multimodal Models

Search

Zhitong Xiong

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (3)