Apr 21, 2026arXiv:2604.19431

Counting Worlds Branching Time Semantics for post-hoc Bias Mitigation in generative AI

Alessandro G. Buda, Giuseppe Primiero, Leonardo Ceragioli, Melissa Antonelli

AI Summary

This paper introduces CTLF, a branching-time logic with counting worlds semantics, to formally reason about and mitigate bias amplification in generative AI outputs. CTLF enables verification of whether an output series respects a desired probability distribution over protected attributes and predicts the likelihood of remaining within acceptable fairness bounds as generation progresses. The framework also determines the number of outputs needed to remove to restore fairness, demonstrated on a biased image generation example.

Key Contribution

Get formal guarantees on fairness in generative AI by reasoning about possible output sequences, not just individual generations.

Abstract

Generative AI systems are known to amplify biases present in their training data. While several inference-time mitigation strategies have been proposed, they remain largely empirical and lack formal guarantees. In this paper we introduce CTLF, a branching-time logic designed to reason about bias in series of generative AI outputs. CTLF adopts a counting worlds semantics where each world represents a possible output at a given step in the generation process and introduces modal operators that allow us to verify whether the current output series respects an intended probability distribution over a protected attribute, to predict the likelihood of remaining within acceptable bounds as new outputs are generated, and to determine how many outputs are needed to remove in order to restore fairness. We illustrate the framework on a toy example of biased image generation, showing how CTLF formulas can express concrete fairness properties at different points in the output series.

Constitutional AI & AI Ethics Natural Language Processing

Citation Metrics

Citations0

Influential citations0

References0

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

Counting Worlds Branching Time Semantics for post-hoc Bias Mitigation in generative AI

Related Papers