Search papers, labs, and topics across Lattice.
This paper introduces Skill-MAS, a novel approach for enhancing the performance of automatic Multi-Agent Systems (MAS) by evolving a high-level orchestration capability termed Meta-Skill, which decouples experience retention from parametric updates. By employing a closed optimization loop that includes Multi-Trajectory Rollout and Selective Reflection, Skill-MAS effectively distills systemic experience into generalizable strategies while leveraging the strengths of large language models. Experimental results across four complex benchmarks demonstrate significant performance improvements and robust transferability of the evolved Meta-Skills across various tasks and LLMs.
Evolving Meta-Skills enables automatic Multi-Agent Systems to achieve superior performance without sacrificing experience retention or scalability.
Large Language Model (LLM)-based automatic Multi-Agent Systems (MAS) generation has become a crucial frontier for tackling complex tasks. However, existing methods face a dilemma between model capability and experience retention. Inference-time MAS leverages frozen frontier LLMs but repeats identical searches without learning from past experience. Conversely, Training-time MAS internalizes experience via gradient updates but is constrained by the low capability ceiling of smaller models, and is hard to scale to large frontier LLMs. To bridge this gap, we propose Skill-MAS, a novel third path that decouples experience retention from parametric updates by conceptualizing the high-level orchestration capability as an evolvable Meta-Skill. Skill-MAS refines this architectural knowledge through a closed optimization loop: (1) Multi-Trajectory Rollout samples a behavioral distribution for each task under the current Meta-Skill; and (2) Selective Reflection adaptively selects priority tasks and applies hierarchical contrastive analysis to distill systemic experience into generalizable, strategy-level principles. Extensive experiments across four complex benchmarks and four distinct LLMs demonstrate that Skill-MAS not only achieves remarkable performance gains but also maintains a favorable cost-performance trade-off. Further analysis reveals that the evolved Meta-Skills are highly robust and exhibit strong transferability across unseen tasks and different LLMs.