Search papers, labs, and topics across Lattice.
The paper introduces EpiPersona, a framework that explicitly decouples stable personal traits from episode-specific factors for improved pluralistic preference modeling in LLMs. EpiPersona projects preference feedback into a low-dimensional persona space, aggregating similar personas into shared discrete codes to separate enduring traits from situational signals. Experiments demonstrate that EpiPersona outperforms baselines, especially in episodic-shift scenarios and with sparse preference data, indicating improved generalization across diverse contexts.
LLMs can better adapt to diverse preferences by explicitly separating stable personal traits from situational factors, leading to significant performance gains, especially when preferences shift across episodes.
Pluralistic alignment is essential for adapting large language models (LLMs) to the diverse preferences of individuals and minority groups. However, existing approaches often mix stable personal traits with episode-specific factors, limiting their ability to generalize across episodes. To address this challenge, we introduce EpiPersona, a framework for explicit persona-episode coupling. EpiPersona first projects noisy preference feedback into a low-dimensional persona space, where similar personas are aggregated into shared discrete codes. This process separates enduring personal characteristics from situational signals without relying on predefined preference dimensions. The inferred persona representation is then coupled with the current episode, enabling episode-aware preference prediction. Extensive experiments show that EpiPersona consistently outperforms the baselines. It achieves notable performance gains in hard episodic-shift scenarios, while remaining effective with sparse preference data.