Yaocheng Zhang

Chinese Academy of Sciences ♣

Papers on Lattice

Total citations

Topics

Publication activitypapers/week, last 8 weeks

Research focus

Tool Use & Agents (3)World Models & Planning (2)Scalable Oversight & Alignment Theory (1)Data Curation & Synthetic Data (1)

Frequent co-authors

Dongbin Zhao (3)Songjun Tu (2)Chengdong Xu (2)Linjing Li (2)

Papers (3)

Jun 28, 2026

2w ago

UCOB: Learning to Utilize and Evolve Agentic Skills via Credit-Aware On-Policy Bidirectional Self-Distillation

UCOB achieves unprecedented performance in agentic reinforcement learning by dynamically refining skill usage through credit-aware self-distillation.

Songjun Tu, Chengdong Xu, Qichao Zhang +6

Scalable Oversight & Alignment Theory Tool Use & Agents

Apr 15, 2026

$\pi$-Play: Multi-Agent Self-Play via Privileged Self-Distillation without External Data

Self-play can be dramatically improved by exploiting the "question construction path" it generates as privileged information for self-distillation, leading to 2-3x faster learning.

Yaocheng Zhang, Yuanheng Zhu, Yuanheng Zhu +8

Data Curation & Synthetic Data Tool Use & Agents Training Efficiency & Optimization+1

Mar 30, 2026

Dynamic Dual-Granularity Skill Bank for Agentic RL

Agentic RL agents can learn faster and perform better by dynamically maintaining a skill bank that combines high-level task guidance with low-level step-by-step decision support.

Chengdong Xu, Yaocheng Zhang, Xiangyuan Lan +2

Robotics & Embodied AI Tool Use & Agents World Models & Planning

Search

Yaocheng Zhang

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (3)