Search papers, labs, and topics across Lattice.
3
0
5
0
Guard models trained with BraveGuard can detect safety threats in computer-use agents with over 82% accuracy, a significant leap from conventional methods.
Alignment isn't enough: truly safe AI demands robust runtime controllability, which current methods often fail to provide.
Skill-based agents, designed for modularity and scalability, are shockingly vulnerable: a single compromised skill can turn the entire system into a weapon.