Search papers, labs, and topics across Lattice.
This paper explores the use of the Claude Code agent, powered by Claude Opus 4.6, to automatically generate a formal correctness proof for the administrative normal form (ANF) transformation within the CertiCoq verified compiler. The work demonstrates the potential of LLMs to automate complex formal verification tasks, significantly reducing the manual effort required for compiler development. The agent successfully produced a 7,800-line Rocq proof in approximately 96 hours, adapting techniques from a previously human-developed proof for the continuation-passing style (CPS) transformation.
LLMs can now tackle substantial formal verification tasks from scratch, slashing development time for verified compilers.
We report on using an agentic coding assistant (Claude Code, powered by Claude Opus 4.6) to mechanize a substantial Rocq correctness proof from scratch, with human guidance but without human proof writing. The proof establishes semantic preservation for the administrative normal form (ANF) transformation in the CertiCoq verified compiler for Rocq. The closely related continuation-passing style (CPS) transformation in CertiCoq was previously proved correct by human experts over several months. We use this proof as a template and instruct the LLM to adapt the proof technique to the ANF setting, which differs in important technical ways. The resulting ANF proof comprises approximately 7,800 lines of Rocq (larger than the 5,300-line CPS proof) and was developed in approximately 96 hours. We describe the proof technique and report on the experience of developing it with an LLM, discussing both the strengths and limitations of the approach and its implications for verified compiler construction.