Search papers, labs, and topics across Lattice.
This paper introduces a Hilbert-augmented reinforcement learning framework for decentralized multi-robot coverage and exploration, addressing the challenges of sparse rewards and scalability. The authors integrate Hilbert space-filling curves as spatial priors into DQN and PPO algorithms to guide exploration and minimize redundant coverage. Experimental results demonstrate improved coverage efficiency, reduced redundancy, and faster convergence compared to standard DQN/PPO, validated both in simulation and on a Boston Dynamics Spot robot.
Hilbert space-filling curves can supercharge multi-robot RL, enabling faster convergence and more efficient coverage in sparse-reward environments.
We present a coverage framework that integrates Hilbert space-filling priors into decentralized multi-robot learning and execution. We augment DQN and PPO with Hilbert-based spatial indices to structure exploration and reduce redundancy in sparse-reward environments, and we evaluate scalability in multi-robot grid coverage. We further describe a waypoint interface that converts Hilbert orderings into curvature-bounded, time-parameterized SE(2) trajectories (planar (x, y, θ)), enabling onboard feasibility on resource-constrained robots. Experiments show improvements in coverage efficiency, redundancy, and convergence speed over DQN/PPO baselines. In addition, we validate the approach on a Boston Dynamics Spot legged robot, executing the generated trajectories in indoor environments and observing reliable coverage with low redundancy. These results indicate that geometric priors improve autonomy and scalability for swarm and legged robotics.