Mar 3, 2026arXiv:2603.02783

Generative adversarial imitation learning for robot swarms: Learning from human demonstrations and trained policies

AI Summary

This paper introduces a generative adversarial imitation learning (GAIL) framework for training robot swarms from human demonstrations. The framework learns collective behaviors by discriminating between human demonstrations and the swarm's actions, and is evaluated on six missions using both manual and PPO-derived demonstrations. Results demonstrate the framework's ability to learn meaningful behaviors that perform comparably to the demonstrations, and these learned policies transfer well to real-world TurtleBot 4 swarms.

Key Contribution

Robot swarms can now learn complex collective behaviors directly from human demonstrations using a GAIL framework, achieving performance on par with expert policies and successfully transferring to real-world robots.

Abstract

In imitation learning, robots are supposed to learn from demonstrations of the desired behavior. Most of the work in imitation learning for swarm robotics provides the demonstrations as rollouts of an existing policy. In this work, we provide a framework based on generative adversarial imitation learning that aims to learn collective behaviors from human demonstrations. Our framework is evaluated across six different missions, learning both from manual demonstrations and demonstrations derived from a PPO-trained policy. Results show that the imitation learning process is able to learn qualitatively meaningful behaviors that perform similarly well as the provided demonstrations. Additionally, we deploy the learned policies on a swarm of TurtleBot 4 robots in real-robot experiments. The exhibited behaviors preserved their visually recognizable character and their performance is comparable to the one achieved in simulation.

RLHF & Preference Learning Robotics & Embodied AI

Citation Metrics

Citations0

Influential citations0

References29

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

Generative adversarial imitation learning for robot swarms: Learning from human demonstrations and trained policies

Related Papers