Yatong Bai

Papers on Lattice

Total citations

Topics

h-index

Research focus

Computer Vision (1)RLHF & Preference Learning (1)

Frequent co-authors

Jonah Casebeer (1)S. Sojoudi (1)Nicholas J. Bryan (1)

Papers (1)

Apr 21, 2025

Yatong Bai +3Apr 21, 2025

DRAGON: Distributional Rewards Optimize Diffusion Generative Models

Forget RLHF and DPO – DRAGON lets you fine-tune generative models with rewards that compare entire *distributions* of outputs, unlocking better control and quality without human preference data.

Yatong Bai, Jonah Casebeer, S. Sojoudi +1

Computer Vision RLHF & Preference Learning

Search

Yatong Bai

Research focus

Frequent co-authors

Papers (1)