Yibin Liu

Lequn Fu, Yijun Zhong, Xiao Li, and Yibin Liu are with the Huazhong University of Science and Technology, Wuhan 430074, China (e-mail: fulq@hust.edu.cn; zhongyijun@hust.edu.cn; li_xiao@hust.edu.cn; liu_yibin@hust.edu.cn). Zhiyuan Xu, and Jian Tang are with the Beijing Humanoid Robot Innovation Center, Beijing 100086, China (e-mail: eric.xu@x-humanoid.com; jian.tang@x-humanoid.com). Shiqi Li is with the Huazhong University of Science and Technology, Wuhan 430074, China (e-mail: sqli@hust.edu.cn)

Papers on Lattice

Total citations

Topics

h-index

Research focus

Multimodal Models (1)Reasoning & Chain-of-Thought (1)RLHF & Preference Learning (1)Robotics & Embodied AI (1)

Frequent co-authors

Yaxing Lyu (1)Daqi Gao (1)Daqiang Gao (1)Zhixuan Liang (1)

Papers (1)

Mar 16, 2026

Mar 16, 2026·also HUST, ZJU

From Passive Observer to Active Critic: Reinforcement Learning Elicits Process Reasoning for Robotic Manipulation

A 7B model trained with RL can outperform 72B-scale general MLLMs in robotic manipulation process supervision by explicitly reasoning about progress toward the final task goal.

Yibin Liu, Yaxing Lyu, Daqi Gao +6

Multimodal Models Reasoning & Chain-of-Thought RLHF & Preference Learning+1

Search

Yibin Liu

Research focus

Frequent co-authors

Papers (1)