May 4, 2026arXiv:2605.02396

HeavySkill: Heavy Thinking as the Inner Skill in Agentic Harness

Jianing Wang, Linsen Guo, Zhengyu Chen, Qisong Guo, Hongyu Zang, Wenjie Shi, Haoxiang Ma, Xiangyu Xi, Xiaoyu Li, Xunliang Cai

AI Summary

The paper introduces "HeavySkill," a perspective that reframes complex reasoning in agentic frameworks as an inner skill within LLM parameters, rather than solely relying on external orchestration. This inner skill is modeled as a two-stage pipeline: parallel reasoning followed by summarization. Empirical results demonstrate that HeavySkill outperforms Best-of-N baselines and can be further scaled via reinforcement learning, suggesting a path to self-improving LLMs.

Key Contribution

Forget brittle orchestration layers – LLMs can internalize complex reasoning as a learnable "HeavySkill" that rivals external agentic frameworks.

Abstract

Recent advances in agentic harness with orchestration frameworks that coordinate multiple agents with memory, skills, and tool use have achieved remarkable success in complex reasoning tasks. However, the underlying mechanism that truly drives performance remains obscured behind intricate system designs. In this paper, we propose HeavySkill, a perspective that views heavy thinking not only as a minimal execution unit in orchestration harness but also as an inner skill internalized within the model's parameters that drives the orchestrator to solve complex tasks. We identify this skill as a two-stage pipeline, i.e., parallel reasoning then summarization, which can operate beneath any agentic harness. We present a systematic empirical study of HeavySkill across diverse domains. Our results show that this inner skill consistently outperforms traditional Best-of-N (BoN) strategies; notably, stronger LLMs can even approach Pass@N performance. Crucially, we demonstrate that the depth and width of heavy thinking, as a learnable skill, can be further scaled via reinforcement learning, offering a promising path toward self-evolving LLMs that internalize complex reasoning without relying on brittle orchestration layers.

Reasoning & Chain-of-Thought Tool Use & Agents

Citation Metrics

Citations0

Influential citations0

References37

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

HeavySkill: Heavy Thinking as the Inner Skill in Agentic Harness

Related Papers