Mar 8, 2026arXiv:2603.07618

SMAT: Staged Multi-Agent Training for Co-Adaptive Exoskeleton Control

Yifei Yuan, Ghaith Androwis, Xianlian Zhou

AI Summary

The paper introduces Staged Multi-Agent Training (SMAT), a four-stage curriculum designed to address the non-stationary learning problem of co-adaptive exoskeleton control by mirroring the natural human acclimation process. SMAT trains a musculoskeletal human actor and a bilateral hip exoskeleton actor progressively, first with the human learning unassisted gait and adapting to device mass, then the exoskeleton learning assistance against a stabilized human, and finally both co-adapting. Simulations show a 10.1% reduction in hip muscle activation, and physical experiments with five subjects demonstrate consistent positive mechanical power delivery (13.6 W to 23.8 W) without subject-specific retraining.

Key Contribution

Exoskeletons can learn to assist human movement without explicit timing cues or subject-specific retraining, thanks to a staged multi-agent training approach that mimics natural human adaptation.

Abstract

Effective exoskeleton assistance requires co-adaptation: as the device alters joint dynamics, the user reorganizes neuromuscular coordination, creating a non-stationary learning problem. Most learning-based approaches do not explicitly account for the sequential nature of human motor adaptation, leading to training instability and poorly timed assistance. We propose Staged Multi-Agent Training (SMAT), a four-stage curriculum designed to mirror how users naturally acclimate to a wearable device. In SMAT, a musculoskeletal human actor and a bilateral hip exoskeleton actor are trained progressively: the human first learns unassisted gait, then adapts to the added device mass; the exoskeleton subsequently learns a positive assistance pattern against a stabilized human policy, and finally both agents co-adapt with full torque capacity and bidirectional feedback. We implement SMAT in the MyoAssist simulation environment using a 26-muscle lower-limb model and an attached hip exoskeleton. Our musculoskeletal simulations demonstrate that the learned exoskeleton control policy produces an average 10.1% reduction in hip muscle activation relative to the no-assist condition. We validated the learned controller in an offline setting using open-source gait data, then deployed it to a physical hip exoskeleton for treadmill experiments with five subjects. The resulting policy delivers consistent assistance and predominantly positive mechanical power without the need for any explicitly imposed timing shift (mean positive power: 13.6 W at 6 Nm RMS torque to 23.8 W at 9.3 Nm RMS torque, with minimal negative power) consistently across all subjects without subject-specific retraining.

Robotics & Embodied AI Training Efficiency & Optimization

Citation Metrics

Citations0

Influential citations0

References0

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

SMAT: Staged Multi-Agent Training for Co-Adaptive Exoskeleton Control

Related Papers