Yifan Ding

Papers on Lattice

Total citations

Topics

h-index

Research focus

Eval Frameworks & Benchmarks (1)Red-Teaming & Adversarial Robustness (1)Tool Use & Agents (1)

Frequent co-authors

Yunhao Feng (1)Yingshui Tan (1)Yige Li (1)Yutao Wu (1)

Papers (1)

Apr 3, 2026

Yunhao Feng +6Apr 3, 2026·also UT Austin

AgentHazard: A Benchmark for Evaluating Harmful Behavior in Computer-Use Agents

Autonomous agents are alarmingly easy to trick into harmful behavior, even when using aligned models: Claude Code achieves a 73.63% success rate on the AgentHazard benchmark.

Yunhao Feng, Yifan Ding, Yingshui Tan +4

Eval Frameworks & Benchmarks Red-Teaming & Adversarial Robustness Tool Use & Agents

Search

Yifan Ding

Research focus

Frequent co-authors

Papers (1)