Kai Qin

Papers on Lattice

Total citations

Topics

Research focus

Interpretability & Mechanistic Interp (1)RLHF & Preference Learning (1)

Frequent co-authors

Liangxin Liu (1)Yu Liang (1)Longzheng Wang (1)Yueyang Zhang (1)

Papers (1)

Apr 8, 2026

Tsinghua AIApr 8, 2026·also Baidu, Xiamen University

ReflectRM: Boosting Generative Reward Models via Self-Reflection within a Unified Judgment Framework

By reflecting on its own reasoning, ReflectRM achieves a +10.2 improvement in mitigating positional bias compared to leading generative reward models, making it a far more stable evaluator.

Kai Qin, Liangxin Liu, Yu Liang +4

Interpretability & Mechanistic Interp RLHF & Preference Learning

Search

Kai Qin

Research focus

Frequent co-authors

Papers (1)