Search papers, labs, and topics across Lattice.
The paper introduces Persona Dynamic Decoding (PDD), a novel framework for enhancing persona adherence in role-playing language agents by dynamically estimating the context-dependent importance of persona attributes. PDD leverages a Persona Importance Estimation (PIE) module to quantify the contextual relevance of persona attributes without ground-truth supervision, and a Persona-Guided Inference-Time Alignment (PIA) paradigm to modulate generation probabilities during inference based on weighted multi-objective rewards. Experiments demonstrate that PDD improves utterance consistency and behavioral fidelity compared to existing static persona management strategies.
Forget static prompts: this method dynamically adjusts persona influence during decoding, boosting role-playing agent realism without costly fine-tuning.
The utility of Role-Playing Language Agents in sociological research is growing alongside the adoption of Large Language Models. For realism in social simulation, these agents must adhere to their personas defined by character profiles, yet existing strategies-static prompt engineering or costly fine-tuning-fail to adapt personas to dynamic scenarios. Psychological theories, such as the Cognitive-Affective Personality Systems, provide a crucial explanation for this failure: a persona's influence on behavior is not static but varies with the scenarios. This context-dependence highlights the critical need for adaptive persona management. To address this gap, we propose a novel, theory-driven method that dynamically estimates context-dependent persona importance and integrates it into weighted reward-guided decoding, enabling inference-time persona following. Specifically, we introduce the Persona Dynamic Decoding (PDD) framework, which consists of two key components: (1) Persona Importance Estimation (PIE) module, which dynamically quantifies the contextual importance of persona attributes without requiring ground-truth supervision; and (2) Persona-Guided Inference-Time Alignment (PIA) paradigm, which leverages these importance scores to construct weighted multi-objective rewards and modulate generation probabilities during inference. Extensive experiments show the effectiveness of our method in utterance consistency and behavioral fidelity.