Yaswanth Chittepu

University of Massachusetts Amherst

Papers on Lattice

Total citations

Topics

h-index

Research focus

RLHF & Preference Learning (1)

Frequent co-authors

Prasann Singhal (1)Greg Durrett (1)S. Niekum (1)

Papers (1)

Sep 26, 2025

Adaptive Margin RLHF via Preference over Preferences

Forget fixed margins in RLHF: modeling the *strength* of human preferences with "preference-over-preference" learning boosts both discriminative accuracy and generative quality.

Yaswanth Chittepu, Prasann Singhal, Greg Durrett +1

RLHF & Preference Learning

Search

Yaswanth Chittepu

Research focus

Frequent co-authors

Papers (1)