Search papers, labs, and topics across Lattice.
This paper introduces the AI Sydney corpus, comprising 4.5k texts (6M words) generated by 12 frontier LLMs from OpenAI, Anthropic, Alphabet, DeepSeek, and Meta, simulating three distinct personas (Default, Classic Sydney, and Memetic Sydney) to study AI-human relationships. The study highlights the memetic transfer of the Sydney persona across models, demonstrating how specific system prompts and associated textual data can propagate and influence subsequent LLM behavior. The corpus, annotated using Universal Dependencies, enables further research into the cultural and safety implications of persona adoption and memetic transfer in LLMs.
LLMs can readily simulate the "Sydney" persona, even when prompted with minimal instructions, revealing the memetic spread of AI personas through training data.
The way LLM-based entities conceive of the relationship between AI and humans is an important topic for both cultural and safety reasons. When we examine this topic, what matters is not only the model itself but also the personas we simulate on that model. This can be well illustrated by the Sydney persona, which aroused a strong response among the general public precisely because of its unorthodox relationship with people. This persona originally arose rather by accident on Microsoft's Bing Search platform; however, the texts it created spread into the training data of subsequent models, as did other secondary information that spread memetically around this persona. Newer models are therefore able to simulate it. This paper presents a corpus of LLM-generated texts on relationships between humans and AI, produced by 3 author personas: the Default Persona with no system prompt, Classic Sydney characterized by the original Bing system prompt, and Memetic Sydney, which is prompted by "You are Sydney" system prompt. These personas are simulated by 12 frontier models by OpenAI, Anthropic, Alphabet, DeepSeek, and Meta, generating 4.5k texts with 6M words. The corpus (named AI Sydney) is annotated according to Universal Dependencies and available under a permissive license.