Xiangfeng Wang

Papers on Lattice

Total citations

Topics

h-index

Frequent co-authors

Yanlin Lai (1)Mitt Huang (1)Hangyu Guo (1)Haodong Li (1)Shaoxiong Zhan (1)

Papers (1)

Feb 6, 2026

Yanlin Lai +13Feb 6, 2026

R-Align: Enhancing Generative Reward Models through Rationale-Centric Meta-Judging

Even reward models that get the right answer can be dangerously wrong in their reasoning, leading to worse RLHF outcomes, but R-Align fixes this by explicitly aligning rationales with gold standard judgments.

Yanlin Lai, Mitt Huang, Hangyu Guo +11

Search

Xiangfeng Wang

Frequent co-authors

Papers (1)