Wenkai Yang

Papers on Lattice

Total citations

Topics

h-index

Publication activitypapers/week, last 8 weeks

Research focus

Training Efficiency & Optimization (3)Natural Language Processing (2)Inference & Quantization (2)Scalable Oversight & Alignment Theory (1)

Frequent co-authors

Shengda Fan (2)Yankai Lin (2)Jingwen Chen (1)Wenbo Nie (1)

Papers (4)

Jun 3, 2026

Rethinking Continual Experience Internalization for Self-Evolving LLM Agents

Multi-iteration experience learning in LLMs can lead to capability collapse, but strategic adjustments in experience granularity and injection patterns can stabilize and enhance performance.

Jingwen Chen, Wenkai Yang, Shengda Fan +7

Natural Language Processing Scalable Oversight & Alignment Theory Training Efficiency & Optimization

Apr 14, 2026

Tsinghua AIApr 14, 2026

Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe

OPD's "free lunch" of dense token-level reward may be an illusion, as teacher novelty, not just higher scores, drives successful distillation.

Yuxin Zuo, Yuxin Zuo, Jinqian Zhang +6

Inference & Quantization Natural Language Processing Training Efficiency & Optimization

Mar 15, 2026

Tsinghua AIMar 15, 2026·also BJTU, RUC

AgentProcessBench: Diagnosing Step-Level Process Quality in Tool-Using Agents

Tool-using agents may seem capable, but they struggle to distinguish neutral actions from errors, highlighting a critical need for better step-level process understanding.

Shengda Fan, Xuyan Ye, Yupeng Huo +9

Eval Frameworks & Benchmarks Reasoning & Chain-of-Thought Tool Use & Agents

Feb 12, 2026

Learning beyond Teacher: Generalized On-Policy Distillation with Reward Extrapolation

Students can surpass their teachers in on-policy distillation by extrapolating rewards and merging knowledge from domain experts, challenging the conventional wisdom that students are inherently limited by their teachers' capabilities.

Wenkai Yang, Weijie Liu, Ruobing Xie +3

Inference & Quantization Training Efficiency & Optimization

Search

Wenkai Yang

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (4)