Search papers, labs, and topics across Lattice.
This paper introduces LineRides, a reinforcement learning framework that trains a bicycle robot to perform stunts using only a spatial guideline and sparse key-orientations, bypassing the need for demonstrations or explicit timing. The framework employs a tracking margin to handle infeasible guidelines, measures progress via traveled distance to resolve temporal ambiguity, and uses position- and sequence-based key-orientations to disambiguate motion details. Experiments on the Ultra Mobility Vehicle (UMV) demonstrate the successful acquisition of five distinct stunt behaviors, including MiniHop, LargeHop, ThreePointTurn, Backflip, and DriftTurn, with seamless transitions between normal driving and stunt execution.
Forget hand-crafted reward functions: this RL framework lets a bicycle robot learn complex stunts from just a spatial guideline and a few key poses.
Designing reward functions for agile robotic maneuvers in reinforcement learning remains difficult, and demonstration-based approaches often require reference motions that are unavailable for novel platforms or extreme stunts. We present LineRides, a line-guided learning framework that enables a custom bicycle robot to acquire diverse, commandable stunt behaviors from a user-provided spatial guideline and sparse key-orientations, without demonstrations or explicit timing. LineRides handles physically infeasible guidelines using a tracking margin that permits controlled deviation, resolves temporal ambiguity by measuring progress via traveled distance along the guideline, and disambiguates motion details through position- and sequence-based key-orientations. We evaluate LineRides on the Ultra Mobility Vehicle (UMV) and show that the policy trained with our methods supports seamless transitions between normal driving and stunt execution, enabling five distinct stunts on command: MiniHop, LargeHop, ThreePointTurn, Backflip, and DriftTurn.