The 39th Research Institute of ChinaXJTUJul 31, 2025arXiv:2507.23172

Benchmarking Massively Parallelized Multi-Task Reinforcement Learning for Robotics Tasks

Vira Joshi, Zifan Xu, Bo Liu, Peter Stone, Amy Zhang

AI Summary

The authors introduce MTBench, a massively parallelized multi-task reinforcement learning benchmark for robotics, consisting of 50 manipulation and 20 locomotion tasks implemented in IsaacGym. They evaluate the performance of four base RL algorithms combined with seven state-of-the-art MTRL algorithms within this framework. Their experiments demonstrate the efficiency of MTBench for evaluating MTRL approaches and identify challenges specific to combining massive parallelism with MTRL.

Key Contribution

Massively parallelizing multi-task RL reveals unexpected challenges, suggesting that simply scaling up existing algorithms may not be sufficient for optimal performance in complex robotics scenarios.

Abstract

Multi-task Reinforcement Learning (MTRL) has emerged as a critical training paradigm for applying reinforcement learning (RL) to a set of complex real-world robotic tasks, which demands a generalizable and robust policy. At the same time, \emph{massively parallelized training} has gained popularity, not only for significantly accelerating data collection through GPU-accelerated simulation but also for enabling diverse data collection across multiple tasks by simulating heterogeneous scenes in parallel. However, existing MTRL research has largely been limited to off-policy methods like SAC in the low-parallelization regime. MTRL could capitalize on the higher asymptotic performance of on-policy algorithms, whose batches require data from the current policy, and as a result, take advantage of massive parallelization offered by GPU-accelerated simulation. To bridge this gap, we introduce a massively parallelized $\textbf{M}$ulti-$\textbf{T}$ask $\textbf{Bench}$mark for robotics (MTBench), an open-sourced benchmark featuring a broad distribution of 50 manipulation tasks and 20 locomotion tasks, implemented using the GPU-accelerated simulator IsaacGym. MTBench also includes four base RL algorithms combined with seven state-of-the-art MTRL algorithms and architectures, providing a unified framework for evaluating their performance. Our extensive experiments highlight the superior speed of evaluating MTRL approaches using MTBench, while also uncovering unique challenges that arise from combining massive parallelism with MTRL. Code is available at https://github.com/Viraj-Joshi/MTBench

Distributed Systems & Hardware Robotics & Embodied AI Training Efficiency & Optimization

Citation Metrics

Citations6

Influential citations0

References71

Year2025

VenuearXiv.org

Related Papers

Finding related papers...

Search

Benchmarking Massively Parallelized Multi-Task Reinforcement Learning for Robotics Tasks

Related Papers