Search papers, labs, and topics across Lattice.
The paper introduces Staged Multi-Agent Training (SMAT), a four-stage curriculum designed to address the non-stationary learning problem of co-adaptive exoskeleton control by mirroring the natural human acclimation process. SMAT trains a musculoskeletal human actor and a bilateral hip exoskeleton actor progressively, first with the human learning unassisted gait and adapting to device mass, then the exoskeleton learning assistance against a stabilized human, and finally both co-adapting. Simulations show a 10.1% reduction in hip muscle activation, and physical experiments with five subjects demonstrate consistent positive mechanical power delivery (13.6 W to 23.8 W) without subject-specific retraining.
Exoskeletons can learn to assist human movement without explicit timing cues or subject-specific retraining, thanks to a staged multi-agent training approach that mimics natural human adaptation.
Effective exoskeleton assistance requires co-adaptation: as the device alters joint dynamics, the user reorganizes neuromuscular coordination, creating a non-stationary learning problem. Most learning-based approaches do not explicitly account for the sequential nature of human motor adaptation, leading to training instability and poorly timed assistance. We propose Staged Multi-Agent Training (SMAT), a four-stage curriculum designed to mirror how users naturally acclimate to a wearable device. In SMAT, a musculoskeletal human actor and a bilateral hip exoskeleton actor are trained progressively: the human first learns unassisted gait, then adapts to the added device mass; the exoskeleton subsequently learns a positive assistance pattern against a stabilized human policy, and finally both agents co-adapt with full torque capacity and bidirectional feedback. We implement SMAT in the MyoAssist simulation environment using a 26-muscle lower-limb model and an attached hip exoskeleton. Our musculoskeletal simulations demonstrate that the learned exoskeleton control policy produces an average 10.1% reduction in hip muscle activation relative to the no-assist condition. We validated the learned controller in an offline setting using open-source gait data, then deployed it to a physical hip exoskeleton for treadmill experiments with five subjects. The resulting policy delivers consistent assistance and predominantly positive mechanical power without the need for any explicitly imposed timing shift (mean positive power: 13.6 W at 6 Nm RMS torque to 23.8 W at 9.3 Nm RMS torque, with minimal negative power) consistently across all subjects without subject-specific retraining.