Search papers, labs, and topics across Lattice.
1
0
2
Single-shot jailbreak detection misses a shocking amount of harmful LLM behavior, meaning current safety evaluations are likely overoptimistic.