Xingzhi Yao

JD.com

Papers on Lattice

Total citations

Topics

Publication activitypapers/week, last 8 weeks

Research focus

Recommendation & Information Retrieval (1)RLHF & Preference Learning (1)

Frequent co-authors

Kewei Xu (1)Junbo Qi (1)Yanyan Zou (1)Pengfei Zhang (1)

Papers (1)

Jun 7, 2026

1w ago·also UESTC, Waseda

Adaptive Loss Balancing for Noise-Robust GRPO in Generative Recommendation

Reward-guided optimization can be selectively applied to enhance generative recommendation performance, avoiding the pitfalls of uniform reinforcement learning.

Kewei Xu, Junbo Qi, Yanyan Zou +3

Recommendation & Information Retrieval RLHF & Preference Learning

Search

Xingzhi Yao

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (1)