Search papers, labs, and topics across Lattice.
RepoLaunch is introduced as an LLM-powered agent designed to automate the build and test pipeline for software repositories, regardless of programming language or operating system. It achieves this by automatically resolving dependencies, compiling code, and extracting test results. The system facilitates the creation of large-scale software engineering datasets with minimal human intervention, enabling scalable benchmarking and training of coding agents and LLMs.
Automating software repository build and testing across languages and platforms is now possible, unlocking scalable benchmarking and training for coding agents.
Building software repositories typically requires significant manual effort. Recent advances in large language model (LLM) agents have accelerated automation in software engineering (SWE). We introduce RepoLaunch, the first agent capable of automatically resolving dependencies, compiling source code, and extracting test results for repositories across arbitrary programming languages and operating systems. To demonstrate its utility, we further propose a fully automated pipeline for SWE dataset creation, where task design is the only human intervention. RepoLaunch automates the remaining steps, enabling scalable benchmarking and training of coding agents and LLMs. Notably, several works on agentic benchmarking and training have recently adopted RepoLaunch for automated task generation.