Search papers, labs, and topics across Lattice.
This paper introduces AgentRivet, an automated workflow that leverages Large Language Models to generate missing Rivet routines from particle physics journal publications, addressing the significant gap where only 39% of measurements have documented routines. By extracting physics analysis information and incorporating a multi-step quality control process, AgentRivet produces competent routines with minimal syntax errors and reasonable physics fidelity. However, it also highlights challenges with physics implementation arising from ambiguous definitions in publications and limitations in handling complex observables.
AgentRivet fills a critical gap in particle physics analysis by automatically generating Rivet routines, improving the accessibility of model-independent measurements.
Particle physics collider experiments provide Rivet routines as part of the analysis preservation strategy for model-independent measurements. Rivet is a C++ toolkit that allow new theoretical models to be compared to the measurements, thus aiding the development and tuning of Monte Carlo event generators as well as searches for physics beyond the Standard Model. However, analysis coverage is known to be incomplete, with only 39% of measurements having documented and publicly available Rivet routines. In this article, we design and implement an automated workflow based on Large Language Models with the goal of providing the missing routines. This multi-step workflow, referred to as AgentRivet, extracts the physics analysis information from published papers and writes the missing Rivet routines, with intermediate code- and physics- reviews as part of an autonomous quality control. We report the results obtained using commercial Large Language Models, provided by OpenAI, Anthropic, and Google, for two recent measurements from the ATLAS and CMS experiments. We find that AgentRivet produces competent Rivet routines with few syntax errors. The physics fidelity of the routines is reasonable and follows the explanations given in the relevant publications. Nevertheless, physics-implementation issues do arise and are investigated using the artefacts produced by AgentRivet. The majority of physics implementation issues arise from subtle-but-ambiguous definitions in the given publication, although some models struggle to implement complex observables even when clear definitions are given.