Yunhao Feng

Papers on Lattice

Total citations

Topics

h-index

Publication activitypapers/week, last 8 weeks

Research focus

Tool Use & Agents (3)Constitutional AI & AI Ethics (2)Red-Teaming & Adversarial Robustness (2)Scalable Oversight & Alignment Theory (1)

Frequent co-authors

Yingshui Tan (2)Yige Li (2)Wenke Huang (2)Xiaohu Du (1)

Papers (3)

May 31, 2026

Yunhao Feng +161w ago

BraveGuard: From Open-World Threats to Safer Computer-Use Agents

Guard models trained with BraveGuard can detect safety threats in computer-use agents with over 82% accuracy, a significant leap from conventional methods.

Yunhao Feng, Xiaohu Du, Xinhao Deng +14

Constitutional AI & AI Ethics Red-Teaming & Adversarial Robustness Tool Use & Agents

May 26, 2026

2w ago·also Ant Group

Position: AI Safety Requires Effective Controllability

Alignment isn't enough: truly safe AI demands robust runtime controllability, which current methods often fail to provide.

Yige Li, Yunhao Feng, Jun Sun

Constitutional AI & AI Ethics Scalable Oversight & Alignment Theory Tool Use & Agents

Apr 8, 2026

Yunhao Feng +7Apr 8, 2026

SkillTrojan: Backdoor Attacks on Skill-Based Agent Systems

Skill-based agents, designed for modularity and scalability, are shockingly vulnerable: a single compromised skill can turn the entire system into a weapon.

Yunhao Feng, Yingshui Tan, Boren Zheng +5

Red-Teaming & Adversarial Robustness Robotics & Embodied AI Tool Use & Agents

Search

Yunhao Feng

Publication activitypapers/week, last 8 weeks

Research focus

Frequent co-authors

Papers (3)