Zilong Zheng

Papers on Lattice

Total citations

Topics

h-index

Publication activitypapers/week, last 8 weeks

Research focus

RLHF & Preference Learning (1)Scalable Oversight & Alignment Theory (1)

Frequent co-authors

Xiaobo Wang (1)Tong Wu (1)Mingkong Tang (1)Jiaqi Li (1)

Papers (1)

May 29, 2026

3d ago

The Flip Side of RLHF: On-Policy Feedback for Reward Model Self-Supervised Improvement

Forget expensive human preference data: this new method uses the policy's own value function to self-supervise reward model training, boosting performance across diverse benchmarks and RL algorithms.

Xiaobo Wang, Tong Wu, Mingkong Tang +3

RLHF & Preference Learning Scalable Oversight & Alignment Theory

Search

Zilong Zheng

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (1)