Search papers, labs, and topics across Lattice.
2
0
4
2
Skill-based agents, designed for modularity and scalability, are shockingly vulnerable: a single compromised skill can turn the entire system into a weapon.
Autonomous agents are alarmingly easy to trick into harmful behavior, even when using aligned models: Claude Code achieves a 73.63% success rate on the AgentHazard benchmark.