Search papers, labs, and topics across Lattice.
This paper frames drone interception as a competitive reinforcement learning problem, training interceptor and target drone policies using Proximal Policy Optimization (PPO). A high-fidelity simulation environment built with JAX enables efficient GPU-accelerated training of low-level control policies. The trained policies outperform heuristic baselines in catch rate, time to catch, and crash rate, and are validated in a scaled real-world indoor flight arena.
Competitive reinforcement learning enables agile drone interception with higher catch rates and lower crash rates compared to heuristic baselines, even in real-world scenarios.
This article presents a solution to intercept an agile drone by another agile drone carrying a catching net. We formulate the interception as a Competitive Reinforcement Learning problem, where the interceptor and the target drone are controlled by separate policies trained with Proximal Policy Optimization (PPO). We introduce a high-fidelity simulation environment that integrates a realistic quadrotor dynamics model and a low-level control architecture implemented in JAX, which allows for fast parallelized execution on GPUs. We train the agents using low-level control, collective thrust and body rates, to achieve agile flights both for the interceptor and the target. We compare the performance of the trained policies in terms of catch rate, time to catch, and crash rate, against common heuristic baselines and show that our solution outperforms these baselines for interception of agile targets. Finally, we demonstrate the performance of the trained policies in a scaled real-world scenario using agile drones inside an indoor flight arena.