Junyi Li

Papers on Lattice

Total citations

Topics

h-index

Research focus

Constitutional AI & AI Ethics (1)Red-Teaming & Adversarial Robustness (1)RLHF & Preference Learning (1)

Frequent co-authors

Ziyi Chen (1)Peiran Yu (1)Heng Huang (1)

Papers (1)

Oct 7, 2025

Provably Mitigating Corruption, Overoptimization, and Verbosity Simultaneously in Offline and Online RLHF/DPO Alignment

Finally, a single algorithm, DPO-COV, tackles the trifecta of corrupted preferences, reward overoptimization, and verbosity that plague RLHF and DPO, and it even comes with theoretical guarantees.

Ziyi Chen, Junyi Li, Peiran Yu +1

Constitutional AI & AI Ethics Red-Teaming & Adversarial Robustness RLHF & Preference Learning

Search

Junyi Li

Research focus

Frequent co-authors

Papers (1)