Search papers, labs, and topics across Lattice.
This paper introduces Intention-Guided Reinforcement Learning with Dirichlet Energy constraint (IGDE) to address partial observability and coordination challenges in heterogeneous multi-robot systems. IGDE uses an intention-guided module to derive long-horizon strategies from local observations and incorporates a Dirichlet energy constraint to diversify environmental cognition among robot classes. Experiments and real-world deployments demonstrate that IGDE enhances cooperative efficiency without requiring prior knowledge or explicit heterogeneity modeling.
Heterogeneous robot teams can now cooperate more efficiently in partially observable environments thanks to a new intention-guided RL approach that doesn't require prior knowledge of robot capabilities.
Multi-robot systems have demonstrated significant potential in accomplishing complex tasks, such as cooperative pursuit, search-and-rescue operations. The emergence of heterogeneous robots with diverse capabilities and characteristics shows superior adaptability compared with homogeneous teams. However, in practical applications, global information is typically inaccessible, and composite teams must contend with partial observability and coordination difficulties. To address the issue in heterogeneous multi-robot systems, we propose a novel Intention-Guided reinforcement learning approach with Dirichlet Energy constraint (IGDE). Specifically, an intention-guided module is designed to derive long-horizon strategies based solely on local observations, enabling foresighted decision-making. In addition, a Dirichlet energy constraint is incorporated into the communication process to enhance the diversity of environmental cognition among different classes of robots. Heterogeneous robots perform class-aware actions driven by distinct cognitive representations, thereby enhancing cooperative efficiency. Notably, our approach alleviates the need of prior knowledge and heterogeneity modeling. Extensive comparative experiments and ablation studies verify the effectiveness of the proposed framework. Additionally, real-world deployment is conducted to demonstrate the practicality.