Search papers, labs, and topics across Lattice.
This paper introduces a closed-loop framework, \ours, designed to enhance memory optimization in software engineering agents by grounding memory utility in validated downstream impact. By establishing a task-agnostic evaluation benchmark and an optimization signal, \ours enables SE agents to retain and refine experiences across tasks, addressing the limitations of episodic memory. Experimental results show that \ours leads to significant improvements in success rates and resolve efficiency, while also reducing computational costs across various settings.
Closed-loop memory optimization can boost software engineering agents' success rates by over 5% while slashing computational costs by nearly 10%.
Large language models (LLMs) have enabled powerful software engineering (SE) agents capable of navigating complex codebases and resolving real-world issues. However, these agents remain fundamentally episodic: they fail to retain, refine, and reuse experiences across tasks, repeatedly reconstructing context from scratch and reproducing similar mistakes. Even with memory support, they offer no remedy for the absence of a principled, task-agnostic \textit{memory utility}, making them difficult to evaluate rigorously or generalize across agents and settings. To tackle these limitations, we introduce \ours, a closed-loop framework for memory augmentation in SE agents. \ours grounds memory utility in \textit{validated downstream impact}, establishing utility as both a task-agnostic \textbf{evaluation benchmark} and an annotation-free \textbf{optimization signal}. Through complementary evaluation on \textit{single-episode} and \textit{cross-episode} memory augmentation, results demonstrate that \ours consistently improves SE agents across settings, achieving absolute gains of up to $\uparrow5.25\%$ in success rate and $\uparrow4.63\%$ in resolve efficiency, while substantially reducing computational cost by $\geq9.79\%$. Our project page: \href{https://xhguo7.github.io/MemOp/}{https://xhguo7.github.io/MemOp/}.