May 6, 2026arXiv:2605.05110

LineRides: Line-Guided Reinforcement Learning for Bicycle Robot Stunts

Seungeun Rho, Shamel Fahmi, Jeonghwan Kim, Arianna Ilvonen, Sehoon Ha, Gabriel Nelson

AI Summary

This paper introduces LineRides, a reinforcement learning framework that trains a bicycle robot to perform stunts using only a spatial guideline and sparse key-orientations, bypassing the need for demonstrations or explicit timing. The framework employs a tracking margin to handle infeasible guidelines, measures progress via traveled distance to resolve temporal ambiguity, and uses position- and sequence-based key-orientations to disambiguate motion details. Experiments on the Ultra Mobility Vehicle (UMV) demonstrate the successful acquisition of five distinct stunt behaviors, including MiniHop, LargeHop, ThreePointTurn, Backflip, and DriftTurn, with seamless transitions between normal driving and stunt execution.

Key Contribution

Forget hand-crafted reward functions: this RL framework lets a bicycle robot learn complex stunts from just a spatial guideline and a few key poses.

Abstract

Designing reward functions for agile robotic maneuvers in reinforcement learning remains difficult, and demonstration-based approaches often require reference motions that are unavailable for novel platforms or extreme stunts. We present LineRides, a line-guided learning framework that enables a custom bicycle robot to acquire diverse, commandable stunt behaviors from a user-provided spatial guideline and sparse key-orientations, without demonstrations or explicit timing. LineRides handles physically infeasible guidelines using a tracking margin that permits controlled deviation, resolves temporal ambiguity by measuring progress via traveled distance along the guideline, and disambiguates motion details through position- and sequence-based key-orientations. We evaluate LineRides on the Ultra Mobility Vehicle (UMV) and show that the policy trained with our methods supports seamless transitions between normal driving and stunt execution, enabling five distinct stunts on command: MiniHop, LargeHop, ThreePointTurn, Backflip, and DriftTurn.

Robotics & Embodied AI Training Efficiency & Optimization World Models & Planning

Citation Metrics

Citations0

Influential citations0

References0

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

LineRides: Line-Guided Reinforcement Learning for Bicycle Robot Stunts

Related Papers