Search papers, labs, and topics across Lattice.
CycleRL, a sim-to-real deep reinforcement learning framework, is introduced for autonomous bicycle control, training a neural policy in NVIDIA Isaac Sim using Proximal Policy Optimization (PPO). Systematic domain randomization bridges the sim-to-real gap, enabling direct transfer without an explicit dynamics model. CycleRL achieves high performance in simulation (99.90% balance success, low tracking errors) and successful hardware transfer, demonstrating DRL's effectiveness for adaptable bicycle control.
Forget hand-tuned controllers: a deep RL policy trained in simulation can directly control a real bicycle, achieving impressive balance and tracking performance.
Autonomous bicycles offer a promising agile solution for urban mobility and last-mile logistics, however, conventional control strategies often struggle with their underactuated nonlinear dynamics, suffering from sensitivity to model mismatches and limited adaptability to real-world uncertainties. To address this, this paper presents CycleRL, the first sim-to-real deep reinforcement learning framework designed for robust autonomous bicycle control. Our approach trains an end-to-end neural control policy within the high-fidelity NVIDIA Isaac Sim environment, leveraging Proximal Policy Optimization (PPO) to circumvent the need for an explicit dynamics model. The framework features a composite reward function tailored for concurrent balance maintenance, velocity tracking, and steering control. Crucially, systematic domain randomization is employed to bridge the simulation-to-reality gap and facilitate direct transfer. In simulation, CycleRL achieves considerable performance, including a 99.90% balance success rate, a low steering tracking error of 1.15°, and a velocity tracking error of 0.18 m/s. These quantitative results, coupled with successful hardware transfer, validate DRL as an effective paradigm for autonomous bicycle control, offering superior adaptability over traditional methods. Video demonstrations are available at https://anony6f05.github.io/CycleRL/.