Search papers, labs, and topics across Lattice.
This paper introduces a novel multi-agent reinforcement learning (MARL) framework called "fluid-agent environment" where agents can dynamically create new agents. The work extends game-theoretic solution concepts to accommodate this fluidity and evaluates existing MARL algorithms in fluid versions of Predator-Prey and Level-Based Foraging, as well as a new environment. Results demonstrate that fluid-agent teams can dynamically adjust their size to adapt to environmental demands, unlocking solutions not seen in fixed-population MARL.
Forget fixed teams: this new reinforcement learning framework lets agents spawn new teammates on the fly, unlocking dynamic strategies previously impossible.
The primary focus of multi-agent reinforcement learning (MARL) has been to study interactions among a fixed number of agents embedded in an environment. However, in the real world, the number of agents is neither fixed nor known a priori. Moreover, an agent can decide to create other agents (for example, a cell may divide, or a company may spin off a division). In this paper, we propose a framework that allows agents to create other agents; we call this a fluid-agent environment. We present game-theoretic solution concepts for fluid-agent games and empirically evaluate the performance of several MARL algorithms within this framework. Our experiments include fluid variants of established benchmarks such as Predator-Prey and Level-Based Foraging, where agents can dynamically spawn, as well as a new environment we introduce that highlights how fluidity can unlock novel solution strategies beyond those observed in fixed-population settings. We demonstrate that this framework yields agent teams that adjust their size dynamically to match environmental demands.