Search papers, labs, and topics across Lattice.
This paper introduces an end-to-end visuomotor policy for multi-robot laser tag, trained via MARL and distillation, to address limitations of classic modular approaches. A permutation-invariant feature extractor and depth heatmap input are used to improve performance. The resulting policy achieves a 16.7% improvement in hitting accuracy and 6% improvement in collision avoidance compared to classic methods, and is successfully deployed on real robots.
Ditch the depth maps and inter-robot comms: end-to-end visuomotor policies, trained via MARL and distillation, can significantly outperform classic modular approaches in multi-robot tasks like laser tag.
In this paper, we study multi robot laser tag, a simplified yet practical shooting-game-style task. Classic modular approaches on these tasks face challenges such as limited observability and reliance on depth mapping and inter robot communication. To overcome these issues, we present an end-to-end visuomotor policy that maps images directly to robot actions. We train a high performing teacher policy with multi agent reinforcement learning and distill its knowledge into a vision-based student policy. Technical designs, including a permutation-invariant feature extractor and depth heatmap input, improve performance over standard architectures. Our policy outperforms classic methods by 16.7% in hitting accuracy and 6% in collision avoidance, and is successfully deployed on real robots. Code will be released publicly.