OPPOJun 7, 2026arXiv:2606.08414

PACT: Self-Evolving Physical Safety Alignment for Diffusion Policies in Embodied Manipulation

Lingxuan Wu, Zijian Zhu, Lizhong Wang, Chengyang Ying, Huayu Chen, Xiao Yang, Fangming Liu, Jun Zhu

AI Summary

This paper introduces PACT, a self-evolving framework that enhances the safety of diffusion policies in robotic manipulation by projecting them onto constraint-feasible regions without relying on demonstration data or task rewards. The method employs a reverse-KL objective with dense supervision to distill constraint gradients into the diffusion model, while a progressive curriculum tightens constraints to ensure policy stability and improvement. Experimental results show that PACT reduces safety violations by 31.0% and increases task success rates by 30.7% across various benchmarks, highlighting its effectiveness in balancing safety and performance.

Key Contribution

PACT reduces safety violations by over 30% while simultaneously boosting task success, reshaping the landscape of safe robotic manipulation.

Abstract

Diffusion policies have achieved remarkable success in robotic manipulation, yet they often fail to satisfy strict physical constraints required for safe deployment. Existing approaches impose safety either prematurely during training or reactively via external guardrails at test time, limiting policy expressivity and overall scalability. We propose Physical safety Alignment for Constrained Trajectories (PACT), a self-evolving post-training framework that projects pretrained diffusion policies onto constraint-feasible regions without accessing demonstration data or task rewards. PACT distills constraint gradients into the diffusion model through a reverse-KL objective with dense supervision across timesteps. It incorporates a curriculum that progressively tightens constraints while maintaining theoretically bounded policy shift and monotone improvement, mitigating the safety-performance trade-off from catastrophic forgetting. On simulated and real-world embodied manipulation benchmarks, PACT significantly reduces safety violations by 31.0% on average while improving task success by 30.7%.

Constitutional AI & AI Ethics Robotics & Embodied AI

Citation Metrics

Citations0

Influential citations0

References0

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

PACT: Self-Evolving Physical Safety Alignment for Diffusion Policies in Embodied Manipulation

Related Papers