Search papers, labs, and topics across Lattice.
This paper investigates the integration of AI agent-authored pull requests (PRs) in GitHub using the AIDev dataset, focusing on factors influencing successful merging. Logistic regression analysis reveals that reviewer engagement strongly correlates with PR integration, while large change sizes and force pushes negatively impact merging. Qualitative analysis further indicates that successful integration hinges on agents participating in actionable review loops that align with reviewer expectations.
Human reviewer engagement, not just code quality, is the key to successfully merging AI-authored pull requests.
Autonomous coding agents increasingly contribute to software development by submitting pull requests on GitHub; yet, little is known about how these contributions integrate into human-driven review workflows. We present a large empirical study of agent-authored pull requests using the public AIDev dataset, examining integration outcomes, resolution speed, and review-time collaboration signals. Using logistic regression with repository-clustered standard errors, we find that reviewer engagement has the strongest correlation with successful integration, whereas larger change sizes and coordination-disrupting actions, such as force pushes, are associated with a lower likelihood of merging. In contrast, iteration intensity alone provides limited explanatory power once collaboration signals are considered. A qualitative analysis further shows that successful integration occurs when agents engage in actionable review loops that converge toward reviewer expectations. Overall, our results highlight that the effective integration of agent-authored pull requests depends not only on code quality but also on alignment with established review and coordination practices.