Denghui Geng

Fudan University

Papers on Lattice

Total citations

Topics

Research focus

Recommendation & Information Retrieval (1)RLHF & Preference Learning (1)

Frequent co-authors

Hongru Hou (1)Tiehua Mei (1)Jinhui Huang (1)Ao Xu (1)

Papers (1)

May 27, 2026

May 27, 2026·also School of Computer Science

ProRL: Effective Reinforcement Learning for Proactive Recommendation via Rectified Policy Gradient Estimation

Naive RL in recommender systems suffers from biased gradients that favor longer paths, but ProRL fixes this with a novel reward centering and advantage estimation scheme.

Hongru Hou, Tiehua Mei, Denghui Geng +4

Recommendation & Information Retrieval RLHF & Preference Learning

Search

Denghui Geng

Research focus

Frequent co-authors

Papers (1)