Tsinghua AICASECNUHainan UniversityMiniCPM-o TeamSEUZhongguancun AcademyApr 30, 2026arXiv:2604.27488

Skills-Coach: A Self-Evolving Skill Optimizer via Training-Free GRPO

Yu Tian, Jiawei Chen, Lifan Zheng, Mingxiang Tao, Xinyi Zeng, Zhaoxia Yin, Hang Su, Xian Sun

AI Summary

Skills-Coach is introduced as a training-free framework that leverages Generative Reward Policy Optimization (GRPO) to enhance skill self-evolution in LLM-based agents. It uses diverse task generation, lightweight prompt and code optimization, comparative execution, and traceable evaluation modules to improve skill performance. Experiments on the Skill-X benchmark, comprising 48 diverse skills, show significant performance gains, demonstrating the framework's ability to create more robust and adaptable agents.

Key Contribution

Skills-Coach offers a training-free approach to significantly improve the performance of LLM-based agent skills, proving that even without additional training data, substantial gains are possible.

Abstract

We introduce Skills-Coach, a novel automated framework designed to significantly enhance the self-evolution of skills within Large Language Model (LLM)-based agents. Addressing the current fragmentation of the skill ecosystem, Skills-Coach explores the boundaries of skill capabilities, thereby facilitating the comprehensive competency coverage essential for intelligent applications. The framework comprises four core modules: a Diverse Task Generation Module that systematically creates a comprehensive test suite for various skills; a Lightweight Optimization Module dedicated to optimizing skill prompts and their corresponding code; a Comparative Execution Module facilitating the execution and evaluation of both original and optimized skills; and a Traceable Evaluation Module, which rigorously evaluates performance against specified criteria. Skills-Coach offers flexible execution options through its virtual and real modes. To validate its efficacy, we introduce Skill-X, a comprehensive benchmark dataset consisting of 48 diverse skills. Experimental results demonstrate that Skills-Coach achieves significant performance improvements in skill capability across a wide range of categories, highlighting its potential to advance the development of more robust and adaptable LLM-based agents.

Eval Frameworks & Benchmarks Tool Use & Agents

Citation Metrics

Citations0

Influential citations0

References0

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

Skills-Coach: A Self-Evolving Skill Optimizer via Training-Free GRPO

Related Papers