CMU MLCornellTAUJun 18, 2026arXiv:2606.20155

NAMESAKES: Probing Identity Memorization in Text-to-Image Models

Morris Alper, Vasudha Varadarajan, Moran Yanuka, Angelina Wang, Hadar Averbuch-Elor

AI Summary

This study investigates identity memorization in text-to-image (T2I) models by introducing a novel black-box behavioral probe that can differentiate between memorized and fabricated faces without requiring access to training data or reference images. The researchers benchmark their approach using the NAMESAKES dataset, which includes over a thousand names and faces of public figures, revealing significant insights into how different T2I models handle identity recognition. Key findings indicate that the probe effectively predicts identity memorization and distinguishes between recognized and unrecognized names, highlighting important variances across model architectures.

Key Contribution

T2I models can be effectively probed for identity memorization without any access to training data, revealing surprising differences in how they handle famous versus less recognized names.

Abstract

Text-to-image (T2I) models generate realistic likenesses of some individuals when prompted with their names, raising privacy concerns. However, distinguishing whether a generated face is memorized or fabricated currently requires ground-truth photos, access to training data, or white-box access to model internals, limiting applicability. We introduce a fully black-box behavioral probe that distinguishes between these regimes while requiring no reference photos or prior knowledge of training data. To benchmark this task, we present the NAMESAKES dataset of over one thousand names and faces of public figures spanning a wide range of fame levels, along with perturbed, less famous names. Experiments on state-of-the-art T2I models show that our probe substantially predicts identity memorization and separates memorized from unrecognized names, with further insights into differences across model families.

Computer Vision Constitutional AI & AI Ethics Multimodal Models

Citation Metrics

Citations0

Influential citations0

References0

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

NAMESAKES: Probing Identity Memorization in Text-to-Image Models

Related Papers