Mar 12, 2026arXiv:2603.11980

Learning Visuomotor Policy for Multi-Robot Laser Tag Game

AI Summary

This paper introduces an end-to-end visuomotor policy for multi-robot laser tag, trained via MARL and distillation, to address limitations of classic modular approaches. A permutation-invariant feature extractor and depth heatmap input are used to improve performance. The resulting policy achieves a 16.7% improvement in hitting accuracy and 6% improvement in collision avoidance compared to classic methods, and is successfully deployed on real robots.

Key Contribution

Ditch the depth maps and inter-robot comms: end-to-end visuomotor policies, trained via MARL and distillation, can significantly outperform classic modular approaches in multi-robot tasks like laser tag.

Abstract

In this paper, we study multi robot laser tag, a simplified yet practical shooting-game-style task. Classic modular approaches on these tasks face challenges such as limited observability and reliance on depth mapping and inter robot communication. To overcome these issues, we present an end-to-end visuomotor policy that maps images directly to robot actions. We train a high performing teacher policy with multi agent reinforcement learning and distill its knowledge into a vision-based student policy. Technical designs, including a permutation-invariant feature extractor and depth heatmap input, improve performance over standard architectures. Our policy outperforms classic methods by 16.7% in hitting accuracy and 6% in collision avoidance, and is successfully deployed on real robots. Code will be released publicly.

Computer Vision Multimodal Models Robotics & Embodied AI

Citation Metrics

Citations0

Influential citations0

References32

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

Learning Visuomotor Policy for Multi-Robot Laser Tag Game

Related Papers