Search papers, labs, and topics across Lattice.
This paper introduces RepMT-SAC, a novel framework for multi-task reinforcement learning that enhances sample efficiency and generalization through spectral MDP decomposition. By structuring the value function into a task-agnostic core with minimal task-specific adjustments, RepMT-SAC enables strong zero-shot performance on in-distribution tasks and rapid few-shot adaptation to out-of-distribution tasks. Experimental results on quadcopter trajectory-following tasks show that RepMT-SAC outperforms existing baselines by up to 30%, highlighting its effectiveness in knowledge transfer across tasks.
RepMT-SAC achieves up to 30% better performance in multi-task reinforcement learning by leveraging a task-agnostic core for efficient skill transfer.
Reinforcement learning has achieved remarkable success in learning complex control policies, yet its applicability remains limited due to sample inefficiency and poor generalization across tasks. In this work, we propose RepMT-SAC, a framework for multi-task RL that enables efficient knowledge sharing and robust transfer to new tasks. RepMT-SAC uses spectral MDP decomposition to capture transferable dynamics, structuring the value function into a task-agnostic core with a minimal task-specific adjustment. This design allows for strong zero-shot performance on in-distribution tasks and rapid few-shot adaptation to out-of-distribution tasks. We evaluate RepMT-SAC on quadcopter trajectory-following tasks across in-distribution and out-of-distribution contexts, demonstrating that it outperforms baselines by up to 30%.