Zihao Cheng

Beihang University

Papers on Lattice

Total citations

Topics

Publication activitypapers/week, last 8 weeks

Research focus

Reasoning & Chain-of-Thought (2)Tool Use & Agents (2)RLHF & Preference Learning (1)Eval Frameworks & Benchmarks (1)

Frequent co-authors

Yingyu Shan (2)Yuhang Guo (2)Zeming Liu (2)Xiangrong Zhu (2)

Papers (3)

Jun 17, 2026

1d ago·also Beihang, Independent Researcher

Learning from Own Solutions: Self-Conditioned Credit Assignment for Reinforcement Learning with Verifiable Rewards

Self-conditioning on verified trajectories boosts reinforcement learning performance by over 8%, revealing the power of internal feedback in credit assignment.

Yingyu Shan, Yuhang Guo, Zihao Cheng +7

Reasoning & Chain-of-Thought RLHF & Preference Learning

Apr 13, 2026

Apr 13, 2026·also BIT, Edinburgh, Independent Researcher

Mem$^2$Evolve: Towards Self-Evolving Agents via Co-Evolutionary Capability Expansion and Experience Distillation

LLM agents can get 18% better at tasks by co-evolving their skills and tools, instead of learning them separately.

Zihao Cheng, Zeming Liu, Yingyu Shan +5

Reasoning & Chain-of-Thought Tool Use & Agents

Mar 4, 2026

Mar 4, 2026·also Oxford

LifeBench: A Benchmark for Long-Horizon Multi-Source Memory

Current AI memory systems are surprisingly bad at integrating diverse, real-world information across long time spans, as evidenced by a new benchmark where they only achieve 55% accuracy.

Zihao Cheng, Weixin Wang, Ziyang Ren +10

Eval Frameworks & Benchmarks Tool Use & Agents

Search

Zihao Cheng

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (3)