Search papers, labs, and topics across Lattice.
1
0
3
Language models harbor hidden "PII leakage knobs" – universal activation directions that, when tweaked, dramatically increase the generation of sensitive personal information.