Search papers, labs, and topics across Lattice.
The paper introduces EviACT, a novel agentic program repair framework that leverages execution evidence to guide localization, patch generation, and validation. EviACT employs three evidence-driven guardrails: a retrieval scaffold for grounding repair context, a compile gate for filtering invalid edits, and a test-driven gate for checking target-test recovery. Results across four benchmarks demonstrate that EviACT improves resolve rates by 1.6-6.0 percentage points compared to strong baselines, while also reducing API costs by 70.1-88.6%.
Triaging LLM-based code repair with execution evidence slashes API costs by up to 88% while boosting repair rates.
LLM-based agents have moved automated program repair (APR) from fixed-context patch generation to interactive repository-level repair. However, existing agentic APR systems still struggle to use execution evidence to guide localization, patch generation, and validation. We propose EviACT (Evidence-to-Action), an agentic APR framework that coordinates three evidence-driven guardrails across repair stages. The retrieval scaffold grounds repair context, the compile gate filters invalid edits, and the test-driven gate checks target-test recovery before full regression. Across four benchmarks, EviACT improves resolve rate over the strongest reported comparable baselines by 1.6-6.0 percentage points and shows 70.1-88.6% lower reported per-bug API cost where baseline costs are available. Ablations and diagnostics suggest that these gains are associated with the coordinated evidence-to-action chain, making agentic APR more effective and efficient.