Search papers, labs, and topics across Lattice.
This paper introduces "agentic microphysics" as a methodology for analyzing AI safety risks arising from multi-agent interactions, focusing on local interaction dynamics where agents influence each other. It argues that current safety approaches are insufficient because they don't capture interaction-level mechanisms. The authors propose "generative safety," a methodology for growing phenomena from micro-level conditions to identify risks and design interventions.
Current AI safety evaluations miss the forest for the trees: population-level risks emerge from agent interactions, not isolated model behaviors.
This paper advances a methodological proposal for safety research in agentic AI. As systems acquire planning, memory, tool use, persistent identity, and sustained interaction, safety can no longer be analysed primarily at the level of the isolated model. Population-level risks arise from structured interaction among agents, through processes of communication, observation, and mutual influence that shape collective behaviour over time. As the object of analysis shifts, a methodological gap emerges. Approaches focused either on single agents or on aggregate outcomes do not identify the interaction-level mechanisms that generate collective risks or the design variables that control them. A framework is required that links local interaction structure to population-level dynamics in a causally explicit way, allowing both explanation and intervention. We introduce two linked concepts. Agentic microphysics defines the level of analysis: local interaction dynamics where one agent's output becomes another's input under specific protocol conditions. Generative safety defines the methodology: growing phenomena and elicit risks from micro-level conditions to identify sufficient mechanisms, detect thresholds, and design effective interventions.