Search papers, labs, and topics across Lattice.
This paper introduces GraSP, an innovative executable skill graph architecture that enhances the orchestration of skills for LLM agents by transforming flat skill sets into structured directed acyclic graphs (DAGs) with explicit causal dependencies. The study reveals that rather than simply increasing the number of skills, effective skill orchestration significantly boosts agent performance, achieving up to a 19-point improvement in reward and a 41% reduction in environment steps across multiple benchmarks. GraSP's ability to manage complexity and mitigate the downsides of skill over-retrieval underscores the importance of structured skill execution in advancing LLM capabilities.
Structured orchestration of skills, not sheer quantity, is the secret sauce for boosting LLM agent performance, with GraSP achieving remarkable efficiency gains.
Skill ecosystems for LLM agents have matured rapidly, yet recent benchmarks show that providing agents with more skills does not monotonically improve performance -- focused sets of 2-3 skills outperform comprehensive documentation, and excessive skills actually hurt. The bottleneck has shifted from skill availability to skill orchestration: agents need not more skills, but a structural mechanism to select, compose, and execute them with explicit causal dependencies. We propose GraSP, the first executable skill graph architecture that introduces a compilation layer between skill retrieval and execution. GraSP transforms flat skill sets into typed directed acyclic graphs (DAGs) with precondition-effect edges, executes them with node-level verification, and performs locality-bounded repair through five typed operators -- reducing replanning from O(N) to O(d^h). Across ALFWorld, ScienceWorld, WebShop, and InterCode with eight LLM backbones, GraSP outperforms ReAct, Reflexion, ExpeL, and flat skill baselines in every configuration, improving reward by up to +19 points over the strongest baseline while cutting environment steps by up to 41%. GraSP's advantage grows with task complexity and is robust to both skill over-retrieval and quality degradation, confirming that structured orchestration -- not larger skill libraries -- is the key to reliable agent execution.