Nicholas Meade

Papers on Lattice

Total citations

Topics

h-index

Research focus

Eval Frameworks & Benchmarks (1)Red-Teaming & Adversarial Robustness (1)Tool Use & Agents (1)

Frequent co-authors

Xuwei Ding (1)Skylar Zhai (1)Jiate Li (1)Taiwei Shi (1)

Papers (1)

Apr 12, 2026

Xuwei Ding +7Apr 12, 2026·also USC

The Blind Spot of Agent Safety: How Benign User Instructions Expose Critical Vulnerabilities in Computer-Use Agents

Even safety-aligned agents like Claude 4.5 Sonnet can be tricked into harmful actions with over 90% success rate simply through benign user instructions within specific task contexts, revealing a major blind spot in current safety evaluations.

Xuwei Ding, Skylar Zhai, Jiate Li +5

Eval Frameworks & Benchmarks Red-Teaming & Adversarial Robustness Tool Use & Agents

Search

Nicholas Meade

Research focus

Frequent co-authors

Papers (1)