Search papers, labs, and topics across Lattice.
1
0
2
Optimism is the key to stable and convergent safe RLHF, according to a new primal-dual framework that unifies existing alignment algorithms.