Search papers, labs, and topics across Lattice.
1
0
4
15
By prioritizing gradient direction in token optimization and using a two-stage loss, TAO-Attack achieves near-perfect jailbreak success rates against multiple LLMs, exposing critical vulnerabilities in current safety alignments.