Search papers, labs, and topics across Lattice.
The paper introduces the Articulated-Body Dynamics Network (ABD-Net), a graph neural network architecture that incorporates forward dynamics as an inductive bias for robot policy learning. ABD-Net adapts the inertia propagation mechanism from the Articulated Body Algorithm, using learnable parameters to aggregate inertial quantities across a robot's kinematic tree. Experiments on simulated and real robots (humanoid, quadruped, hopper) demonstrate improved sample efficiency, generalization to dynamics shifts, and successful sim-to-real transfer compared to transformer and GNN baselines.
Robots can learn faster and generalize better by encoding dynamics directly into their neural network architecture, outperforming standard transformers and GNNs.
Recent work in reinforcement learning has shown that incorporating structural priors for articulated robots, such as link connectivity, into policy networks improves learning efficiency. However, dynamics properties, despite their fundamental role in determining how forces and motion propagate through the body, remain largely underexplored as an inductive bias for policy learning. To address this gap, we present the Articulated-Body Dynamics Network (ABD-Net), a novel graph neural network architecture grounded in the computational structure of forward dynamics. Specifically, we adapt the inertia propagation mechanism from the Articulated Body Algorithm, systematically aggregating inertial quantities from child to parent links in a tree-structured manner, while replacing physical quantities with learnable parameters. Embedding ABD-NET into the policy actor enables dynamics-informed representations that capture how actions propagate through the body, leading to efficient and robust policy learning. Through experiments with simulated humanoid, quadruped, and hopper robots, our approach demonstrates increased sample efficiency and generalization to dynamics shifts compared to transformer-based and GNN baselines. We further validate the learned policy on real Unitree G1 and Go2 robots, state-of-the-art humanoid and quadruped platforms, generating dynamic, versatile and robust locomotion behaviors through sim-to-real transfer with real-time inference.