Hengrui Chen

Fudan University

Papers on Lattice

Total citations

Topics

Publication activitypapers/week, last 8 weeks

Research focus

RLHF & Preference Learning (2)Recommendation & Information Retrieval (1)Reasoning & Chain-of-Thought (1)Tool Use & Agents (1)

Frequent co-authors

Hongru Hou (2)Tiehua Mei (2)Ao Xu (2)Denghui Geng (1)

Papers (2)

May 27, 2026

May 27, 2026·also School of Computer Science

ProRL: Effective Reinforcement Learning for Proactive Recommendation via Rectified Policy Gradient Estimation

Naive RL in recommender systems suffers from biased gradients that favor longer paths, but ProRL fixes this with a novel reward centering and advantage estimation scheme.

Hongru Hou, Tiehua Mei, Denghui Geng +4

Recommendation & Information Retrieval RLHF & Preference Learning

Mar 10, 2026

Good Reasoning Makes Good Demonstrations: Implicit Reasoning Quality Supervision via In-Context Reinforcement Learning

Stop training LLMs on lucky guesses: this new RL method uses the model's own in-context learning ability to identify and upweight high-quality reasoning traces, leading to better performance.

Tiehua Mei, Leiyu Pan, Zhenpeng Su +3

Reasoning & Chain-of-Thought RLHF & Preference Learning Tool Use & Agents

Search

Hengrui Chen

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (2)