Feb 17, 2026arXiv:2602.15533

Efficient Knowledge Transfer for Jump-Starting Control Policy Learning of Multirotors through Physics-Aware Neural Architectures

Welf Rehberg, Mihir Kulkarni, Philipp Weiss, Kostas Alexis

AI Summary

The paper introduces a library-based initialization scheme for reinforcement learning of multirotor control policies, enabling efficient knowledge transfer across different multirotor configurations. They propose a physics-aware neural control architecture that integrates a reinforcement learning controller with a supervised control allocation network, facilitating the reuse of pre-trained policies. A policy evaluation-based similarity measure is used to select appropriate policies from a library for initialization, demonstrating a significant reduction in environment interactions (up to 73.5%) compared to training from scratch.

Key Contribution

Forget painstakingly retraining: this method slashes multirotor control policy learning time by 73% by intelligently transferring knowledge between different drone configurations.

Abstract

Efficiently training control policies for robots is a major challenge that can greatly benefit from utilizing knowledge gained from training similar systems through cross-embodiment knowledge transfer. In this work, we focus on accelerating policy training using a library-based initialization scheme that enables effective knowledge transfer across multirotor configurations. By leveraging a physics-aware neural control architecture that combines a reinforcement learning-based controller and a supervised control allocation network, we enable the reuse of previously trained policies. To this end, we utilize a policy evaluation-based similarity measure that identifies suitable policies for initialization from a library. We demonstrate that this measure correlates with the reduction in environment interactions needed to reach target performance and is therefore suited for initialization. Extensive simulation and real-world experiments confirm that our control architecture achieves state-of-the-art control performance, and that our initialization scheme saves on average up to $73.5\%$ of environment interactions (compared to training a policy from scratch) across diverse quadrotor and hexarotor designs, paving the way for efficient cross-embodiment transfer in reinforcement learning.

Architecture Design (Transformers, SSMs, MoE)Robotics & Embodied AI Training Efficiency & Optimization

Citation Metrics

Citations0

Influential citations0

References0

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

Efficient Knowledge Transfer for Jump-Starting Control Policy Learning of Multirotors through Physics-Aware Neural Architectures

Related Papers