Search papers, labs, and topics across Lattice.
This paper introduces a latent-state POMDP framework for a robot to strategically shape a human's prosociality during repeated interactions by inferring and influencing their latent prosocial state through actions like helping and signaling. The model learns the transition and observation dynamics using expectation maximization to balance task objectives with social influence. Experiments using user study data demonstrate that the learned policy outperforms baseline strategies, leading to improved team performance and increased human cooperative behavior.
Robots can learn to strategically nudge humans toward prosocial behavior, boosting cooperation and team performance.
We propose a decision-theoretic framework in which a robot strategically can shape inferred human's prosocial state during repeated interactions. Modeling the human's prosociality as a latent state that evolves over time, the robot learns to infer and influence this state through its own actions, including helping and signaling. We formalize this as a latent-state POMDP with limited observations and learn the transition and observation dynamics using expectation maximization. The resulting belief-based policy balances task and social objectives, selecting actions that maximize long-term cooperative outcomes. We evaluate the model using data from user studies and show that the learned policy outperforms baseline strategies in both team performance and increasing observed human cooperative behavior.