Yuandong Tan

Papers on Lattice

Total citations

Topics

h-index

Research focus

RLHF & Preference Learning (1)Scalable Oversight & Alignment Theory (1)Training Efficiency & Optimization (1)

Papers (1)

Aug 10, 2025

Yuandong TanAug 10, 2025

A Principled Loss Function for Direct Language Model Alignment

DPO's instability problem is solved with a new loss function that directly targets a finite logits difference, leading to better alignment and preventing reward hacking.

Yuandong Tan

RLHF & Preference Learning Scalable Oversight & Alignment Theory Training Efficiency & Optimization

Search

Yuandong Tan

Research focus

Papers (1)