Mar 9, 2026arXiv:2603.08418

Meta-RL with Shared Representations Enables Fast Adaptation in Energy Systems

Théo Zangato, Aomar Osmani, Pegah Alizadeh

AI Summary

This paper introduces a novel Meta-RL framework for Building Energy Management Systems (BEMS) that uses bi-level optimization and a hybrid actor-critic architecture to improve sample efficiency and inter-task adaptability. A key component is the meta-learning of a shared state feature extractor across actor and critic networks to improve knowledge transfer and limit overfitting. The framework also employs a parameter-sharing mechanism between outer- and inner-loop actor networks to accelerate adaptation, demonstrating superior performance compared to conventional RL and Meta-RL methods on a real-world BEMS dataset.

Key Contribution

Meta-RL can learn shared representations that enable faster adaptation in energy systems, outperforming conventional RL and Meta-RL methods on real-world building management data.

Abstract

Meta-Reinforcement Learning addresses the critical limitations of conventional Reinforcement Learning in multi-task and non-stationary environments by enabling fast policy adaptation and improved generalization. We introduce a novel Meta-RL framework that integrates a bi-level optimization scheme with a hybrid actor-critic architecture specially designed to enhance sample efficiency and inter-task adaptability. To improve knowledge transfer, we meta-learn a shared state feature extractor jointly optimized across actor and critic networks, providing efficient representation learning and limiting overfitting to individual tasks or dominant profiles. Additionally, we propose a parameter-sharing mechanism between the outer- and inner-loop actor networks, to reduce redundant learning and accelerate adaptation during task revisitation. The approach is validated on a real-world Building Energy Management Systems dataset covering nearly a decade of temporal and structural variability, for which we propose a task preparation method to promote generalization. Experiments demonstrate effective task adaptation and better performance compared to conventional RL and Meta-RL methods.

Architecture Design (Transformers, SSMs, MoE)Robotics & Embodied AI Training Efficiency & Optimization

Citation Metrics

Citations0

Influential citations0

References0

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

Meta-RL with Shared Representations Enables Fast Adaptation in Energy Systems

Related Papers