Search papers, labs, and topics across Lattice.
Shanghai Jiao Tong University
2
0
6
Shuffling activations, a popular defense in secure Transformer inference, crumbles under a new alignment attack that recovers model weights for just $1.
LLM agents acting in the real world introduce a whole new threat landscape beyond unsafe text, demanding a shift in focus towards system-level security for agent ecosystems.