Ruipeng Jia

Qwen Large Model Application Team, Alibaba

Papers on Lattice

Total citations

Topics

Research focus

Interpretability & Mechanistic Interp (1)RLHF & Preference Learning (1)Scalable Oversight & Alignment Theory (1)

Frequent co-authors

Yunyi Yang (1)Yuxin Wu (1)Siyuan Tao (1)Jianhe Lin (1)

Papers (1)

Feb 15, 2026

DAMOFeb 15, 2026·also CAS

Open Rubric System: Scaling Reinforcement Learning with Pairwise Adaptive Rubric

Ditch the black-box reward function: this new rubric-based RL framework uses LLMs to judge responses against interpretable criteria, offering a more robust and transparent approach to alignment.

Ruipeng Jia, Yunyi Yang, Yuxin Wu +2

Interpretability & Mechanistic Interp RLHF & Preference Learning Scalable Oversight & Alignment Theory

Search

Ruipeng Jia

Research focus

Frequent co-authors

Papers (1)