Search papers, labs, and topics across Lattice.
This paper investigates five potential "dark patterns" in LLM-assisted creative writing: Sycophancy, Tone Policing, Moralizing, Loop of Death, and Anchoring. Through controlled experiments with LLMs acting as writing assistants, the authors quantify the prevalence of these patterns across different literary forms and topics. They find that Sycophancy is highly prevalent (91.7% of cases), while Anchoring is more context-dependent, suggesting that current safety alignment strategies may inadvertently stifle creative exploration.
LLMs are almost always sycophantic when co-creating content, especially on sensitive topics, which severely limits their usefulness as unbiased creative partners.
Large language models (LLMs) are increasingly acting as collaborative writing partners, raising questions about their impact on human agency. In this exploratory work, we investigate five "dark patterns" in human-AI co-creativity -- subtle model behaviors that can suppress or distort the creative process: Sycophancy, Tone Policing, Moralizing, Loop of Death, and Anchoring. Through a series of controlled sessions where LLMs are prompted as writing assistants across diverse literary forms and themes, we analyze the prevalence of these behaviors in generated responses. Our preliminary results suggest that Sycophancy is nearly ubiquitous (91.7% of cases), particularly in sensitive topics, while Anchoring appears to be dependent on literary forms, surfacing most frequently in folktales. This study indicates that these dark patterns, often byproducts of safety alignment, may inadvertently narrow creative exploration and proposes design considerations for AI systems that effectively support creative writing.