Yuhang Guo

Beijing Institute of Technology

Papers on Lattice

Total citations

Topics

Publication activitypapers/week, last 8 weeks

Research focus

Tool Use & Agents (2)RLHF & Preference Learning (2)Reasoning & Chain-of-Thought (1)Natural Language Processing (1)

Frequent co-authors

Zeming Liu (3)Jiashu Yao (2)Heyan Huang (2)Zihao Cheng (1)

Papers (3)

Apr 13, 2026

6d ago·also BIT, Edinburgh, Independent Researcher, Munich Center for Machine Learning

Mem$^2$Evolve: Towards Self-Evolving Agents via Co-Evolutionary Capability Expansion and Experience Distillation

LLM agents can get 18% better at tasks by co-evolving their skills and tools, instead of learning them separately.

Zihao Cheng, Zeming Liu, Yingyu Shan +7

Reasoning & Chain-of-Thought Tool Use & Agents

6d ago·also Beihang, ZJU

Policy Split: Incentivizing Dual-Mode Exploration in LLM Reinforcement with Dual-Mode Entropy Regularization

Forget monolithic policies – splitting your LLM's RL policy into accuracy-focused and exploration-driven modes unlocks better performance and diversity.

Jiashu Yao, Heyan Huang, Chuwei Luo +4

Natural Language Processing RLHF & Preference Learning Training Efficiency & Optimization

6d ago·also Beihang

Utilizing and Calibrating Hindsight Process Rewards via Reinforcement with Mutual Information Self-Evaluation

Open-source 7B LLMs can now rival GPT-4o performance on validation tasks, thanks to a novel reinforcement learning approach that leverages calibrated self-evaluation as a dense reward signal.

Jiashu Yao, Heyan Huang, Zeming Liu +1

RLHF & Preference Learning Tool Use & Agents

Search

Yuhang Guo

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (3)