Search papers, labs, and topics across Lattice.
STRUCTUREDAGENT introduces a hierarchical planning framework for web agents that uses dynamic AND/OR trees for efficient search and a structured memory module to track candidate solutions. This approach addresses limitations of existing web agents in long-horizon tasks, such as limited memory and weak planning. Experiments on WebVoyager, WebArena, and custom shopping benchmarks demonstrate improved performance compared to standard LLM-based agents.
Web agents can now tackle complex, long-horizon tasks more effectively thanks to a hierarchical planner that uses AND/OR trees and structured memory, outperforming standard LLM-based agents.
Recent advances in large language models (LLMs) have enabled agentic systems for sequential decision-making. Such agents must perceive their environment, reason across multiple time steps, and take actions that optimize long-term objectives. However, existing web agents struggle on complex, long-horizon tasks due to limited in-context memory for tracking history, weak planning abilities, and greedy behaviors that lead to premature termination. To address these challenges, we propose STRUCTUREDAGENT, a hierarchical planning framework with two core components: (1) an online hierarchical planner that uses dynamic AND/OR trees for efficient search and (2) a structured memory module that tracks and maintains candidate solutions to improve constraint satisfaction in information-seeking tasks. The framework also produces interpretable hierarchical plans, enabling easier debugging and facilitating human intervention when needed. Our results on WebVoyager, WebArena, and custom shopping benchmarks show that STRUCTUREDAGENT improves performance on long-horizon web-browsing tasks compared to standard LLM-based agents.