IIScFeb 25, 2026arXiv:2602.22120

GeoDiv: Framework For Measuring Geographical Diversity In Text-To-Image Models

Abhipsa Basu, Mohana Singh, Shashank Agnihotri, Margret Keuper, R. Venkatesh Babu

AI Summary

The paper introduces GeoDiv, a framework to evaluate geographical diversity in text-to-image models by analyzing generated images along Socio-Economic Visual Index (SEVI) and Visual Diversity Index (VDI). GeoDiv leverages large language models and vision-language models to identify biases in generated images across countries and entities. Experiments on Stable Diffusion and FLUX.1-dev reveal a lack of diversity and a tendency to generate biased portrayals, particularly for countries like India, Nigeria, and Colombia, which are often depicted with impoverished attributes.

Key Contribution

Text-to-image models consistently depict countries like India and Nigeria as disproportionately impoverished, revealing stark socio-economic biases that GeoDiv can now systematically measure.

Abstract

Text-to-image (T2I) models are rapidly gaining popularity, yet their outputs often lack geographical diversity, reinforce stereotypes, and misrepresent regions. Given their broad reach, it is critical to rigorously evaluate how these models portray the world. Existing diversity metrics either rely on curated datasets or focus on surface-level visual similarity, limiting interpretability. We introduce GeoDiv, a framework leveraging large language and vision-language models to assess geographical diversity along two complementary axes: the Socio-Economic Visual Index (SEVI), capturing economic and condition-related cues, and the Visual Diversity Index (VDI), measuring variation in primary entities and backgrounds. Applied to images generated by models such as Stable Diffusion and FLUX.1-dev across $10$ entities and $16$ countries, GeoDiv reveals a consistent lack of diversity and identifies fine-grained attributes where models default to biased portrayals. Strikingly, depictions of countries like India, Nigeria, and Colombia are disproportionately impoverished and worn, reflecting underlying socio-economic biases. These results highlight the need for greater geographical nuance in generative models. GeoDiv provides the first systematic, interpretable framework for measuring such biases, marking a step toward fairer and more inclusive generative systems. Project page: https://abhipsabasu.github.io/geodiv

Constitutional AI & AI Ethics Eval Frameworks & Benchmarks Multimodal Models

Citation Metrics

Citations0

Influential citations0

References0

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

GeoDiv: Framework For Measuring Geographical Diversity In Text-To-Image Models

Related Papers