Tsinghua AIMar 17, 2026arXiv:2603.16142

Parametric Social Identity Injection and Diversification in Public Opinion Simulation

Hexi Wang, Yujia Zhou, Bangde Du, Qingyao Ai, Yiqun Liu

AI Summary

This paper addresses the limitations of large language models (LLMs) in public opinion simulation, specifically their tendency to produce homogeneous responses due to a phenomenon termed Diversity Collapse in hidden representations. To counter this, the authors introduce Parametric Social Identity Injection (PSII), which embeds demographic attributes and value orientations into the LLM's intermediate hidden states, allowing for more nuanced and diverse outputs. Experimental results demonstrate that PSII significantly enhances the fidelity and diversity of simulated opinions, aligning more closely with real-world survey data while reducing KL divergence.

Key Contribution

Injecting demographic attributes directly into LLM hidden states can drastically improve the diversity and realism of public opinion simulations.

Abstract

Large language models (LLMs) have recently been adopted as synthetic agents for public opinion simulation, offering a promising alternative to costly and slow human surveys. Despite their scalability, current LLM-based simulation methods fail to capture social diversity, producing flattened inter-group differences and overly homogeneous responses across demographic groups. We identify this limitation as a Diversity Collapse phenomenon in LLM hidden representations, where distinct social identities become increasingly indistinguishable across layers. Motivated by this observation, we propose Parametric Social Identity Injection (PSII), a general framework that injects explicit, parametric representations of demographic attributes and value orientations directly into intermediate hidden states of LLMs. Unlike prompt-based persona conditioning, PSII enables fine-grained and controllable identity modulation at the representation level. Extensive experiments on the World Values Survey using multiple open-source LLMs show that PSII significantly improves distributional fidelity and diversity, reducing KL divergence to real-world survey data while enhancing overall diversity. This work provides new insights into representation-level control of LLM agents and advances scalable, diversity-aware public opinion simulation.

Constitutional AI & AI Ethics Natural Language Processing World Models & Planning

Citation Metrics

Citations0

Influential citations0

References42

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

Parametric Social Identity Injection and Diversification in Public Opinion Simulation

Related Papers