Lang Feng

OPID achieves a remarkable boost in agent performance by leveraging hierarchical skills extracted from on-policy trajectories, transforming sparse rewards into dense, actionable insights.

Shuo Yang, Zhengxi Lu, Lang Feng +2

RLHF & Preference Learning Tool Use & Agents

May 26, 2026

Shuo He +3May 26, 2026·also SEU

Beyond Trajectory-Level Attribution: Graph-Based Credit Assignment for Agentic Reinforcement Learning

Don't let valuable steps in failed trajectories go unnoticed: GraphGPO leverages state-transition graphs for fine-grained credit assignment in agentic RL, boosting performance and efficiency.

Shuo He, Lang Feng, Haiyang Xu +1

RLHF & Preference Learning Tool Use & Agents

May 14, 2026

B2∑iMay 14, 2026

EverAnimate: Minute-Scale Human Animation via Latent Flow Restoration

Generate minute-long, high-fidelity animations without visual degradation or character drift using a surprisingly simple latent flow restoration technique.

Wuyang Li, Yang Gao, Mariam Hassan +4

Computer Vision Multimodal Models

Apr 13, 2026

Apr 13, 2026·also B2∑i, Independent

Grounded World Model for Semantically Generalizable Planning

Visuomotor control can now generalize to unseen environments and instructions by grounding world models in a vision-language latent space, outperforming standard vision-language approaches by a large margin.

Quanyi Li, Quanyi Li, Lang Feng +5

Computer Vision Robotics & Embodied AI World Models & Planning

Feb 26, 2026

Feb 26, 2026·also NSFC, SEU

Hierarchy-of-Groups Policy Optimization for Long-Horizon Agentic Tasks

Context inconsistency in stepwise group-based RL can severely bias advantage estimation, but a hierarchical grouping strategy can fix it without extra compute.

Shuo He, Shuo He, Lang Feng +4

RLHF & Preference Learning Tool Use & Agents World Models & Planning

Search

Lang Feng

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (8)