Search papers, labs, and topics across Lattice.
This work shows that robust preference alignment benefits from addressing different noise types with targeted interventions rather than uniform regularization, and proposes wDPO, a robust LLM alignment approach with hierarchical winsorization.