Search papers, labs, and topics across Lattice.
This paper introduces CTLF, a branching-time logic with counting worlds semantics, to formally reason about and mitigate bias amplification in generative AI outputs. CTLF enables verification of whether an output series respects a desired probability distribution over protected attributes and predicts the likelihood of remaining within acceptable fairness bounds as generation progresses. The framework also determines the number of outputs needed to remove to restore fairness, demonstrated on a biased image generation example.
Get formal guarantees on fairness in generative AI by reasoning about possible output sequences, not just individual generations.
Generative AI systems are known to amplify biases present in their training data. While several inference-time mitigation strategies have been proposed, they remain largely empirical and lack formal guarantees. In this paper we introduce CTLF, a branching-time logic designed to reason about bias in series of generative AI outputs. CTLF adopts a counting worlds semantics where each world represents a possible output at a given step in the generation process and introduces modal operators that allow us to verify whether the current output series respects an intended probability distribution over a protected attribute, to predict the likelihood of remaining within acceptable bounds as new outputs are generated, and to determine how many outputs are needed to remove in order to restore fairness. We illustrate the framework on a toy example of biased image generation, showing how CTLF formulas can express concrete fairness properties at different points in the output series.