East China University of Science and TechnologyShanghai Huahong Grace SemiconductorApr 27, 2026arXiv:2604.24657

AgentWard: A Lifecycle Security Architecture for Autonomous AI Agents

Yixiang Zhang, Xinhao Deng, Jiaqi Wu, Yue Xiao, Ke Xu, Qi Li

AI Summary

AgentWard is a security architecture designed to protect autonomous AI agents by organizing defenses across five key lifecycle stages: initialization, input processing, memory, decision-making, and execution. It integrates stage-specific security controls with cross-layer coordination to intercept threats as they propagate through the system. A prototype implementation on OpenClaw demonstrates the practical feasibility of AgentWard as a blueprint for structuring runtime security controls and managing trust propagation.

Key Contribution

Securing autonomous AI agents demands a lifecycle-oriented approach, and AgentWard provides a blueprint for defense-in-depth across initialization, input processing, memory, decision-making, and execution.

Abstract

Autonomous AI agents extend large language models into full runtime systems that load skills, ingest external content, maintain memory, plan multi-step actions, and invoke privileged tools. In such systems, security failures rarely remain confined to a single interface; instead, they can propagate across initialization, input processing, memory, decision-making, and execution, often becoming apparent only when harmful effects materialize in the environment. This paper presents AgentWard, a lifecycle-oriented, defense-in-depth architecture that systematically organizes protection across these five stages. AgentWard integrates stage-specific, heterogeneous controls with cross-layer coordination, enabling threats to be intercepted along their propagation paths while safeguarding critical assets. We detail the design rationale and architecture of five coordinated protection layers, and implement a plugin-native prototype on OpenClaw to demonstrate practical feasibility. This perspective provides a concrete blueprint for structuring runtime security controls, managing trust propagation, and enforcing execution containment in autonomous AI agents. Our code is available at https://github.com/FIND-Lab/AgentWard .

Constitutional AI & AI Ethics Red-Teaming & Adversarial Robustness Tool Use & Agents

Citation Metrics

Citations0

Influential citations0

References18

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

AgentWard: A Lifecycle Security Architecture for Autonomous AI Agents

Related Papers