Search papers, labs, and topics across Lattice.
1
0
3
4
LLM safety is a cat-and-mouse game: ORPO excels at breaking alignment, while DPO is best at restoring it, but at the cost of overall usefulness.