Feb 25, 2026arXiv:2602.21509

Fair Model-based Clustering

Jinwon Park, Kunwoong Kim, Jihu Lee, Yongdai Kim

AI Summary

This paper introduces Fair Model-based Clustering (FMC), a novel fair clustering algorithm based on a finite mixture model, addressing the scalability limitations of existing fair K-means variants. FMC decouples the number of parameters from the sample size, enabling mini-batch learning for approximate fairness and applicability to non-metric data. Experiments and theoretical analysis demonstrate FMC's superior scalability and performance compared to existing methods.

Key Contribution

Fair clustering just got a whole lot faster: FMC achieves fairness without the computational bottleneck of sample-dependent parameters.

Abstract

The goal of fair clustering is to find clusters such that the proportion of sensitive attributes (e.g., gender, race, etc.) in each cluster is similar to that of the entire dataset. Various fair clustering algorithms have been proposed that modify standard K-means clustering to satisfy a given fairness constraint. A critical limitation of several existing fair clustering algorithms is that the number of parameters to be learned is proportional to the sample size because the cluster assignment of each datum should be optimized simultaneously with the cluster center, and thus scaling up the algorithms is difficult. In this paper, we propose a new fair clustering algorithm based on a finite mixture model, called Fair Model-based Clustering (FMC). A main advantage of FMC is that the number of learnable parameters is independent of the sample size and thus can be scaled up easily. In particular, mini-batch learning is possible to obtain clusters that are approximately fair. Moreover, FMC can be applied to non-metric data (e.g., categorical data) as long as the likelihood is well-defined. Theoretical and empirical justifications for the superiority of the proposed algorithm are provided.

Constitutional AI & AI Ethics Natural Language Processing

Citation Metrics

Citations0

Influential citations0

References0

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

Fair Model-based Clustering

Related Papers