Fei Huang

Papers on Lattice

Total citations

Topics

h-index

Publication activitypapers/week, last 8 weeks

Research focus

RLHF & Preference Learning (2)Natural Language Processing (1)Recommendation & Information Retrieval (1)Constitutional AI & AI Ethics (1)

Frequent co-authors

Yongbin Li (2)Pinyi Zhang (1)Ting-En Lin (1)Yuchuan Wu (1)

Papers (2)

Feb 12, 2026

3w ago

P-GenRM: Personalized Generative Reward Model with Test-time User-based Scaling

P-GenRM personalizes LLMs more effectively by generating adaptive personas and scoring rubrics from user preferences, outperforming existing reward models by 2.31% and offering a 3% boost via test-time scaling.

Pinyi Zhang, Ting-En Lin, Yuchuan Wu +7

Natural Language Processing Recommendation & Information Retrieval RLHF & Preference Learning

Mar 12, 2025

Mar 12, 2025·also B-Ins)

A Survey of Direct Preference Optimization

DPO's rise as a computationally efficient alternative to RLHF for LLM alignment has spurred a diverse range of research, now systematically organized and analyzed in this comprehensive survey.

Shunyu Liu, Wenkai Fang, Zetian Hu +925

Constitutional AI & AI Ethics RLHF & Preference Learning Training Efficiency & Optimization

Search

Fei Huang

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (2)