Mar 2, 2026arXiv:2603.02379

Strategic Shaping of Human Prosociality: A Latent-State POMDP Framework

Zahra Zahedi, Xinyue Hu, Shashank Mehrotra, Mark Steyvers, Kumar Akash

AI Summary

This paper introduces a latent-state POMDP framework for a robot to strategically shape a human's prosociality during repeated interactions by inferring and influencing their latent prosocial state through actions like helping and signaling. The model learns the transition and observation dynamics using expectation maximization to balance task objectives with social influence. Experiments using user study data demonstrate that the learned policy outperforms baseline strategies, leading to improved team performance and increased human cooperative behavior.

Key Contribution

Robots can learn to strategically nudge humans toward prosocial behavior, boosting cooperation and team performance.

Abstract

We propose a decision-theoretic framework in which a robot strategically can shape inferred human's prosocial state during repeated interactions. Modeling the human's prosociality as a latent state that evolves over time, the robot learns to infer and influence this state through its own actions, including helping and signaling. We formalize this as a latent-state POMDP with limited observations and learn the transition and observation dynamics using expectation maximization. The resulting belief-based policy balances task and social objectives, selecting actions that maximize long-term cooperative outcomes. We evaluate the model using data from user studies and show that the learned policy outperforms baseline strategies in both team performance and increasing observed human cooperative behavior.

Constitutional AI & AI Ethics RLHF & Preference Learning Robotics & Embodied AI

Citation Metrics

Citations0

Influential citations0

References0

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

Strategic Shaping of Human Prosociality: A Latent-State POMDP Framework

Related Papers