Zhenyu Wu

Decomposing GUI agent trajectories into verifiable milestones and auditing the evidence chain yields a 10% boost in RL training performance, outperforming single-judge reward systems.

Zhenyu Wu, Yibo Zhao, Yibo Zhao +19

RLHF & Preference Learning Robotics & Embodied AI Tool Use & Agents

Mar 16, 2026

Mar 16, 2026·also CMU ML, Shanghai AI Lab, Xidian

RoCo Challenge at AAAI 2026: Benchmarking Robotic Collaborative Manipulation for Assembly Towards Industrial Automation

Strategic recovery from failures is key to deploying robots for complex assembly tasks in the real world.

Haichao Liu, Yuheng Zhou, Zhenyu Wu +17

Eval Frameworks & Benchmarks Robotics & Embodied AI World Models & Planning

Mar 11, 2026

Mar 11, 2026·also IIT Bombay, Shanghai AI Lab, Unitree

SteadyTray: Learning Object Balancing Tasks in Humanoid Tray Transport via Residual Reinforcement Learning

Humanoid robots can now reliably transport objects on a tray in the real world, thanks to a hierarchical RL approach that isolates and cancels gait-induced disturbances.

Anlun Huang, Zhenyu Wu, Simranjeet Singh +2

Robotics & Embodied AI Training Efficiency & Optimization

Search

Zhenyu Wu

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (4)