Search papers, labs, and topics across Lattice.
This paper introduces Iterative Motion Imitation (IMI), a reinforcement learning method for training policies that imitate infeasible reference motions, enabling agile behaviors in robots. IMI iteratively refines trajectories by imitating rollouts from prior policies, starting from an initial, imperfect reference. The method is demonstrated on a bicycle robot (UMV), successfully training policies to perform ground-to-ground and ground-to-table front-flips from a self-colliding reference trajectory, achieving higher success rates and real-world transferability compared to single-shot imitation.
Bicycle robots can now do front-flips, thanks to a reinforcement learning method that bootstraps from dynamically infeasible reference motions.
This work demonstrates a front-flip on bicycle robots via reinforcement learning, particularly by imitating reference motions that are infeasible and imperfect. To address this, we propose Iterative Motion Imitation(IMI), a method that iteratively imitates trajectories generated by prior policy rollouts. Starting from an initial reference that is kinematically or dynamically infeasible, IMI helps train policies that lead to feasible and agile behaviors. We demonstrate our method on Ultra-Mobility Vehicle (UMV), a bicycle robot that is designed to enable agile behaviors. From a self-colliding table-to-ground flip reference generated by a model-based controller, we are able to train policies that enable ground-to-ground and ground-to-table front-flips. We show that compared to a single-shot motion imitation, IMI results in policies with higher success rates and can transfer robustly to the real world. To our knowledge, this is the first unassisted acrobatic flip behavior on such a platform.