Search papers, labs, and topics across Lattice.
This study investigates identity memorization in text-to-image (T2I) models by introducing a novel black-box behavioral probe that can differentiate between memorized and fabricated faces without requiring access to training data or reference images. The researchers benchmark their approach using the NAMESAKES dataset, which includes over a thousand names and faces of public figures, revealing significant insights into how different T2I models handle identity recognition. Key findings indicate that the probe effectively predicts identity memorization and distinguishes between recognized and unrecognized names, highlighting important variances across model architectures.
T2I models can be effectively probed for identity memorization without any access to training data, revealing surprising differences in how they handle famous versus less recognized names.
Text-to-image (T2I) models generate realistic likenesses of some individuals when prompted with their names, raising privacy concerns. However, distinguishing whether a generated face is memorized or fabricated currently requires ground-truth photos, access to training data, or white-box access to model internals, limiting applicability. We introduce a fully black-box behavioral probe that distinguishes between these regimes while requiring no reference photos or prior knowledge of training data. To benchmark this task, we present the NAMESAKES dataset of over one thousand names and faces of public figures spanning a wide range of fame levels, along with perturbed, less famous names. Experiments on state-of-the-art T2I models show that our probe substantially predicts identity memorization and separates memorized from unrecognized names, with further insights into differences across model families.