ANITI 2 FranceIMT FranceINRIAIRIT FranceMar 17, 2026arXiv:2603.16749

Probing Cultural Signals in Large Language Models through Author Profiling

Valentin Lafargue, Ariel Guerra-Adames, E. Claeys, Emmanuelle Claeys, Elouan Vuichard, Jean-Michel Loubes, J. Loubes

AI Summary

The study probes cultural biases in LLMs by evaluating their zero-shot author profiling performance (gender and ethnicity prediction) on song lyrics. Results show that LLMs can perform author profiling without fine-tuning, but exhibit systematic cultural alignment biases, with most models defaulting to North American ethnicity. The authors introduce Modality Accuracy Divergence (MAD) and Recall Divergence (RD) metrics to quantify these disparities, revealing that Mistral-8B displays the strongest ethnicity bias.

Key Contribution

LLMs can guess a singer's ethnicity from their lyrics, but they're biased: most default to North American, while DeepSeek-1.5B leans Asian.

Abstract

Large language models (LLMs) are increasingly deployed in applications with societal impact, raising concerns about the cultural biases they encode. We probe these representations by evaluating whether LLMs can perform author profiling from song lyrics in a zero-shot setting, inferring singers'gender and ethnicity without task-specific fine-tuning. Across several open-source models evaluated on more than 10,000 lyrics, we find that LLMs achieve non-trivial profiling performance but demonstrate systematic cultural alignment: most models default toward North American ethnicity, while DeepSeek-1.5B aligns more strongly with Asian ethnicity. This finding emerges from both the models'prediction distributions and an analysis of their generated rationales. To quantify these disparities, we introduce two fairness metrics, Modality Accuracy Divergence (MAD) and Recall Divergence (RD), and show that Ministral-8B displays the strongest ethnicity bias among the evaluated models, whereas Gemma-12B shows the most balanced behavior. Our code is available on GitHub (https://github.com/ValentinLafargue/CulturalProbingLLM).

Constitutional AI & AI Ethics Eval Frameworks & Benchmarks Natural Language Processing

Citation Metrics

Citations0

Influential citations0

References59

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

Probing Cultural Signals in Large Language Models through Author Profiling

Related Papers