Mar 16, 2026arXiv:2603.15388

Efficient Morphology-Control Co-Design via Stackelberg Proximal Policy Optimization

Yanning Dai, Yuhui Wang, Dylan R. Ashley, Jürgen Schmidhuber

AI Summary

The paper addresses the challenge of morphology-control co-design, where an agent's body structure and control policy are jointly optimized. They introduce Stackelberg Proximal Policy Optimization (Stackelberg PPO), which models the coupling between morphology and control as a Stackelberg game, explicitly incorporating the control's adaptation dynamics into morphology optimization. Experiments on various co-design tasks demonstrate that Stackelberg PPO achieves improved stability and performance compared to standard PPO by aligning morphology updates with control adaptation.

Key Contribution

By explicitly modeling the interplay between morphology and control, Stackelberg PPO stabilizes training and boosts performance in morphology-control co-design tasks.

Abstract

Morphology-control co-design concerns the coupled optimization of an agent's body structure and control policy. This problem exhibits a bi-level structure, where the control dynamically adapts to the morphology to maximize performance. Existing methods typically neglect the control's adaptation dynamics by adopting a single-level formulation that treats the control policy as fixed when optimizing morphology. This can lead to inefficient optimization, as morphology updates may be misaligned with control adaptation. In this paper, we revisit the co-design problem from a game-theoretic perspective, modeling the intrinsic coupling between morphology and control as a novel variant of a Stackelberg game. We propose Stackelberg Proximal Policy Optimization (Stackelberg PPO), which explicitly incorporates the control's adaptation dynamics into morphology optimization. By modeling this intrinsic coupling, our method aligns morphology updates with control adaptation, thereby stabilizing training and improving learning efficiency. Experiments across diverse co-design tasks demonstrate that Stackelberg PPO outperforms standard PPO in both stability and final performance, opening the way for dramatically more efficient robotics designs.

Robotics & Embodied AI Training Efficiency & Optimization

Citation Metrics

Citations0

Influential citations0

References0

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

Efficient Morphology-Control Co-Design via Stackelberg Proximal Policy Optimization

Related Papers