Search papers, labs, and topics across Lattice.
Beijing University of Posts and Telecommunications
2
0
5
LLM-based multi-agent systems are surprisingly vulnerable: a new RL-based attacker can evolve sophisticated, long-horizon attacks by exploiting trust in external tools.
TwinGate stops jailbreaks by tracking malicious intent across anonymized, interleaved queries with minimal overhead, something previous defenses couldn't do.