Search papers, labs, and topics across Lattice.
This paper introduces a reinforcement learning (RL) framework for enabling quadrotors to autonomously recover from falls and achieve stable hover from arbitrary ground attitudes using minimal onboard sensors. The approach addresses challenges such as partial observability and sensor invalidity by employing a recurrent policy within an asymmetric actor-critic architecture, coupled with an Incremental Nonlinear Dynamic Inversion (INDI) controller for effective tracking. Real-world experiments confirm the method's robustness, achieving successful recovery across various initial conditions and disturbances without the need for explicit state estimation.
Agile quadrotors can recover from falls and stabilize in hover using only lightweight sensors, even in the face of severe disturbances and sensor failures.
Autonomous fall recovery is a critical capability for quadrotors operating in real-world environments, where collisions or failures may leave the vehicle resting on the ground in an arbitrary attitude. This problem is challenging because recovery must be achieved under limited onboard sensing, in constrained free space, with ground contact, and in the presence of unknown disturbances. In this letter, we present an RL-based framework for autonomous fall recovery of a quadrotor from arbitrary ground attitudes to stable hover using only lightweight onboard sensors. To address severe partial observability and intermittent sensor invalidity, we train a recurrent policy within an asymmetric actor--critic architecture, leveraging an Incremental Nonlinear Dynamic Inversion (INDI) controller to track the policy output. Combined with high-fidelity simulations of motor response and optical flow, the overall training framework significantly reduces the sim-to-real gap. Simulation ablation studies validate the importance of the main design choices, while real-world experiments demonstrate zero-shot transfer and robust recovery under different initial attitudes, wind disturbances, and additional payloads. These results demonstrate that agile quadrotor fall recovery can be achieved without explicit state estimation using only limited and unreliable onboard sensing.