Search papers, labs, and topics across Lattice.
This paper introduces MAPLE, a novel framework for hierarchical multi-label classification (HMLC) in remote sensing that tackles the challenge of multi-path label activation. MAPLE integrates graph-aware textual descriptions for semantic initialization, graph convolutional networks (GCNs) for structure encoding, and adaptive multi-modal fusion to balance semantic priors and visual evidence. Experiments on remote sensing datasets demonstrate MAPLE's effectiveness, achieving up to a 42% improvement in few-shot regimes with minimal parameter overhead.
Achieve massive gains in few-shot hierarchical multi-label classification (+42%) by adaptively balancing semantic priors and visual evidence using level-aware embeddings.
Hierarchical multi-label classification (HMLC) is essential for modeling structured label dependencies in remote sensing. Yet existing approaches struggle in multi-path settings, where images may activate multiple taxonomic branches, leading to underuse of hierarchical information. We propose MAPLE (Multi-Path Adaptive Propagation with Level-Aware Embeddings), a framework that integrates (i) hierarchical semantic initialization from graph-aware textual descriptions, (ii) graph-based structure encoding via graph convolutional networks (GCNs), and (iii) adaptive multi-modal fusion that dynamically balances semantic priors and visual evidence. An adaptive level-aware objective automatically selects appropriate losses per hierarchy level. Evaluations on CORINE-aligned remote sensing datasets (AID, DFC-15, and MLRSNet) show consistent improvements of up to +42% in few-shot regimes while adding only 2.6% parameter overhead, demonstrating that MAPLE effectively and efficiently models hierarchical semantics for Earth observation (EO).