Search papers, labs, and topics across Lattice.
DiffRacing, a novel vector field-augmented differentiable policy learning framework, is introduced to address the challenges of autonomous drone racing, where gate traversal is difficult to express as smooth, differentiable losses. The framework integrates differentiable losses and vector fields to provide continuous gradient signals, balancing obstacle avoidance and high-speed gate traversal. Experimental results in both simulation and real-world settings demonstrate superior sample efficiency, faster convergence, and robust flight performance, showcasing the benefits of augmenting gradient-based policy learning with task-specific geometric priors.
Vector fields can guide differentiable policy learning to achieve agile drone racing, enabling faster convergence and better sim-to-real transfer.
Autonomous drone racing in complex environments requires agile, high-speed flight while maintaining reliable obstacle avoidance. Differentiable-physics-based policy learning has recently demonstrated high sample efficiency and remarkable performance across various tasks, including agile drone flight and quadruped locomotion. However, applying such methods to drone racing remains difficult, as key objective like gate traversal are inherently hard to express as smooth, differentiable losses. To address these challenges, we propose DiffRacing, a novel vector field-augmented differentiable policy learning framework. DiffRacing integrates differentiable losses and vector fields into the training process to provide continuous and stable gradient signals, balancing obstacle avoidance and high-speed gate traversal. In addition, a differentiable Delta Action Model compensates for dynamics mismatch, enabling efficient sim-to-real transfer without explicit system identification. Extensive simulation and real-world experiments demonstrate that DiffRacing achieves superior sample efficiency, faster convergence, and robust flight performance, thereby demonstrating that vector fields can augment traditional gradient-based policy learning with a task-specific geometric prior.