Search papers, labs, and topics across Lattice.
This paper introduces a framework for influencing LLM-based multi-agent dialogue by parameterizing prompts as actions, constructing a lightweight policy without training. The framework dynamically generates prompts using five components based on the agent's current state, treating prompts as actions. Experiments in two discussion scenarios demonstrate that this policy-parameterized prompting can effectively influence dialogue dynamics, as measured by responsiveness, rebuttal, evidence usage, non-repetition, and stance shift.
Forget RLHF – steering LLM multi-agent conversations might be as simple as crafting the right sequence of prompts.
Large Language Models (LLMs) have emerged as a new paradigm for multi-agent systems. However, existing research on the behaviour of LLM-based multi-agents relies on ad hoc prompts and lacks a principled policy perspective. Different from reinforcement learning, we investigate whether prompt-as-action can be parameterized so as to construct a lightweight policy which consists of a sequence of state-action pairs to influence conversational behaviours without training. Our framework regards prompts as actions executed by LLMs, and dynamically constructs prompts through five components based on the current state of the agent. To test the effectiveness of parameterized control, we evaluated the dialogue flow based on five indicators: responsiveness, rebuttal, evidence usage, non-repetition, and stance shift. We conduct experiments using different LLM-driven agents in two discussion scenarios related to the general public and show that prompt parameterization can influence the dialogue dynamics. This result shows that policy-parameterised prompts offer a simple and effective mechanism to influence the dialogue process, which will help the research of multi-agent systems in the direction of social simulation.