Search papers, labs, and topics across Lattice.
This paper introduces ESAA, an architecture for LLM-based autonomous agents that addresses limitations like state management and context degradation by separating agent intentions from project state mutations using the Event Sourcing pattern. ESAA ensures deterministic execution and traceability by having agents emit structured intentions in JSON, which are then validated, persisted in an append-only log, and used to project a verifiable materialized view. Case studies involving a landing page project and a clinical dashboard system, the latter using concurrent orchestration with heterogeneous LLMs, demonstrate ESAA's efficacy and scalability, achieving successful task completion and verification.
LLM agents can achieve deterministic execution and forensic traceability by decoupling cognitive intention from state mutation via an Event Sourcing architecture.
Autonomous agents based on Large Language Models (LLMs) have evolved from reactive assistants to systems capable of planning, executing actions via tools, and iterating over environment observations. However, they remain vulnerable to structural limitations: lack of native state, context degradation over long horizons, and the gap between probabilistic generation and deterministic execution requirements. This paper presents the ESAA (Event Sourcing for Autonomous Agents) architecture, which separates the agent's cognitive intention from the project's state mutation, inspired by the Event Sourcing pattern. In ESAA, agents emit only structured intentions in validated JSON (agent.result or issue.report); a deterministic orchestrator validates, persists events in an append-only log (activity.jsonl), applies file-writing effects, and projects a verifiable materialized view (roadmap.json). The proposal incorporates boundary contracts (AGENT_CONTRACT.yaml), metaprompting profiles (PARCER), and replay verification with hashing (esaa verify), ensuring the immutability of completed tasks and forensic traceability. Two case studies validate the architecture: (i) a landing page project (9 tasks, 49 events, single-agent composition) and (ii) a clinical dashboard system (50 tasks, 86 events, 4 concurrent agents across 8 phases), both concluding with run.status=success and verify_status=ok. The multi-agent case study demonstrates real concurrent orchestration with heterogeneous LLMs (Claude Sonnet 4.6, Codex GPT-5, Antigravity/Gemini 3 Pro, and Claude Opus 4.6), providing empirical evidence of the architecture's scalability beyond single-agent scenarios.