Fan Yang

China Academy of Space Technology, Beijing, China

Papers on Lattice

Total citations

Topics

h-index

Frequent co-authors

Haonan Song (1)Qingchen Xie (1)Huan Zhu (1)Feng Xiao (1)Luxi Xing (1)

Papers (1)

Jan 2, 2026

Haonan Song +14Jan 2, 2026·also China Academy of Space Technology, I, Shenzhen University of Advanced Technology

IRPM: Intergroup Relative Preference Modeling for Pointwise Generative Reward Models

Pointwise reward models can finally compete with pairwise models in RLHF, thanks to a new intergroup comparison method that scales linearly with the number of candidates.

Haonan Song, Qingchen Xie, Huan Zhu +12

Search

Fan Yang

Frequent co-authors

Papers (1)