CUHKFudanHUSTShanghai AI LabShanghai InnovationSJTUJun 8, 2026arXiv:2606.09365

Experience Makes Skillful: Enabling Generalizable Medical Agent Reasoning via Self-Evolving Skill Memory

Haoran Sun, Wenjie Li, Yujie Zhang, Zekai Lin, Fanrui Zhang, Kaitao Chen, Xingqi He, Yichen Li, Mianxin Liu, Lei Liu, Yankai Jiang

AI Summary

This paper introduces SkeMex, a self-evolving framework designed to enhance medical agents' reasoning capabilities by utilizing a skill-based memory system that operates independently of model weight updates. By distilling informative interaction trajectories into structured skills and employing a context-dependent utility estimation for memory governance, SkeMex effectively organizes and retrieves relevant procedural knowledge. Experimental results demonstrate that SkeMex significantly outperforms existing memory-based agents in various clinical tasks, showcasing its ability to generalize across different model architectures and support transferable skills.

Key Contribution

SkeMex enables medical agents to evolve their reasoning capabilities by transforming raw experience into structured, reusable skills, outperforming traditional memory systems.

Abstract

Medical agent systems are increasingly expected to support interactive clinical decision making rather than only static question answering. In such settings, effective agents must reuse prior experience across evolving cases, yet existing memory mechanisms often retain raw historical traces that are redundant, noisy, and difficult to govern. More importantly, they rarely distinguish which memories are truly useful for future reasoning. This limits their ability to accumulate compact and reliable experience for long-horizon clinical reasoning. To close this gap, we propose SkeMex, a post-deployment self-evolution framework that improves medical agents through a skill-based memory without updating model weights. SkeMex distills informative interaction trajectories into structured skills that encode reusable procedural knowledge, and organizes them into a multi-branch repository spanning general, task-specific, and action-level experience. To determine which memories should be reused and retained, SkeMex estimates context-dependent utility from environment feedback and uses it to guide value-aware retrieval and repository governance. A closed-loop ``Read--Write--Assess--Govern" lifecycle further supports continual evolution by writing new skills, updating utilities, promoting useful memories, and removing harmful entries. Experiments across diverse clinical tasks show that SkeMex consistently outperforms representative memory-based agents in both offline and online settings. It also generalizes across model backbones and supports transferable skill memory. All data and code will be released publicly.

Reasoning & Chain-of-Thought Tool Use & Agents

Citation Metrics

Citations0

Influential citations0

References0

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

Experience Makes Skillful: Enabling Generalizable Medical Agent Reasoning via Self-Evolving Skill Memory

Related Papers