Search papers, labs, and topics across Lattice.
The paper introduces a library-based initialization scheme for reinforcement learning of multirotor control policies, enabling efficient knowledge transfer across different multirotor configurations. They propose a physics-aware neural control architecture that integrates a reinforcement learning controller with a supervised control allocation network, facilitating the reuse of pre-trained policies. A policy evaluation-based similarity measure is used to select appropriate policies from a library for initialization, demonstrating a significant reduction in environment interactions (up to 73.5%) compared to training from scratch.
Forget painstakingly retraining: this method slashes multirotor control policy learning time by 73% by intelligently transferring knowledge between different drone configurations.
Efficiently training control policies for robots is a major challenge that can greatly benefit from utilizing knowledge gained from training similar systems through cross-embodiment knowledge transfer. In this work, we focus on accelerating policy training using a library-based initialization scheme that enables effective knowledge transfer across multirotor configurations. By leveraging a physics-aware neural control architecture that combines a reinforcement learning-based controller and a supervised control allocation network, we enable the reuse of previously trained policies. To this end, we utilize a policy evaluation-based similarity measure that identifies suitable policies for initialization from a library. We demonstrate that this measure correlates with the reduction in environment interactions needed to reach target performance and is therefore suited for initialization. Extensive simulation and real-world experiments confirm that our control architecture achieves state-of-the-art control performance, and that our initialization scheme saves on average up to $73.5\%$ of environment interactions (compared to training a policy from scratch) across diverse quadrotor and hexarotor designs, paving the way for efficient cross-embodiment transfer in reinforcement learning.