Seung Han

Papers on Lattice

Total citations

Topics

h-index

Research focus

RLHF & Preference Learning (1)

Frequent co-authors

Taehyun Cho (1)Seokhun Ju (1)Dohyeong Kim (1)Kyungjae Lee (1)

Papers (1)

May 6, 2025

Taehyun Cho +5May 6, 2025

Policy-labeled Preference Learning: Is Preference Enough for RLHF?

Correcting for suboptimal behavior during preference learning unlocks substantial gains in offline RLHF and improves online performance in continuous control tasks.

Taehyun Cho, Seokhun Ju, Seung Han +3

RLHF & Preference Learning

Search

Seung Han

Research focus

Frequent co-authors

Papers (1)