Search papers, labs, and topics across Lattice.
2
0
5
33
LLM agents can have their proprietary skills stolen with just 3 interactions, exposing a major copyright vulnerability in the burgeoning skill marketplace.
LLM safety is a cat-and-mouse game: ORPO excels at breaking alignment, while DPO is best at restoring it, but at the cost of overall usefulness.