Feb 23, 2026arXiv:2602.20066

HeatPrompt: Zero-Shot Vision-Language Modeling of Urban Heat Demand from Satellite Images

Kundan Thota, Xuanhao Mu, Thorsten Schlachter, Veit Hagenmeyer

AI Summary

The paper introduces HeatPrompt, a zero-shot vision-language framework for estimating building-level annual heat demand using satellite imagery and limited GIS data. HeatPrompt leverages a domain-specific prompt to guide a pretrained VLM in extracting semantic features from satellite images relevant to thermal load. An MLP regressor trained on the VLM-generated captions achieves a 93.7% $R^2$ uplift and a 30% MAE reduction compared to a baseline model, demonstrating the efficacy of the approach.

Key Contribution

Skip expensive building-level data collection: HeatPrompt uses satellite images and vision-language models to accurately map urban heat demand.

Abstract

Accurate heat-demand maps play a crucial role in decarbonizing space heating, yet most municipalities lack detailed building-level data needed to calculate them. We introduce HeatPrompt, a zero-shot vision-language energy modeling framework that estimates annual heat demand using semantic features extracted from satellite images, basic Geographic Information System (GIS), and building-level features. We feed pretrained Large Vision Language Models (VLMs) with a domain-specific prompt to act as an energy planner and extract the visual attributes such as roof age, building density, etc, from the RGB satellite image that correspond to the thermal load. A Multi-Layer Perceptron (MLP) regressor trained on these captions shows an $R^2$ uplift of 93.7% and shrinks the mean absolute error (MAE) by 30% compared to the baseline model. Qualitative analysis shows that high-impact tokens align with high-demand zones, offering lightweight support for heat planning in data-scarce regions.

Computer Vision Multimodal Models Natural Language Processing

Citation Metrics

Citations0

Influential citations0

References0

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

HeatPrompt: Zero-Shot Vision-Language Modeling of Urban Heat Demand from Satellite Images

Related Papers