Search papers, labs, and topics across Lattice.
The paper introduces SWE-Adept, a two-agent LLM framework designed to improve performance on repository-level software engineering tasks by addressing challenges in codebase navigation and iterative code modification. SWE-Adept employs an agent-directed depth-first search for issue localization, minimizing irrelevant context and improving accuracy, and utilizes adaptive planning with specialized tools for version control and progress tracking for issue resolution. Experiments on SWE-Bench Lite and SWE-Bench Pro show SWE-Adept outperforms existing methods, increasing the end-to-end resolve rate by up to 4.7%.
LLMs can now navigate and modify large codebases more effectively using a two-agent system with depth-first search and Git-aware version control, boosting issue resolution rates by nearly 5%.
Large language models (LLMs) exhibit strong performance on self-contained programming tasks. However, they still struggle with repository-level software engineering (SWE), which demands (1) deep codebase navigation with effective context management for accurate localization, and (2) systematic approaches for iterative, test-driven code modification to resolve issues. To address these challenges, we propose SWE-Adept, an LLM-based two-agent framework where a localization agent identifies issue-relevant code locations and a resolution agent implements the corresponding fixes. For issue localization, we introduce agent-directed depth-first search that selectively traverses code dependencies. This minimizes issue-irrelevant content in the agent's context window and improves localization accuracy. For issue resolution, we employ adaptive planning and structured problem solving. We equip the agent with specialized tools for progress tracking and Git-based version control. These tools interface with a shared working memory that stores code-state checkpoints indexed by execution steps, facilitating precise checkpoint retrieval. This design enables reliable agent-driven version-control operations for systematic issue resolution, including branching to explore alternative solutions and reverting failed edits. Experiments on SWE-Bench Lite and SWE-Bench Pro demonstrate that SWE-Adept consistently outperforms prior approaches in both issue localization and resolution, improving the end-to-end resolve rate by up to 4.7%.