Search papers, labs, and topics across Lattice.
GenePlan uses LLM-assisted evolutionary algorithms to generate domain-dependent generalized planners from PDDL descriptions. It evolves interpretable Python planners to minimize plan length across diverse problem instances, effectively framing generalized planning as an optimization problem. GenePlan achieves a 0.91 average SAT score across eight domains, closely matching state-of-the-art planners while significantly outperforming LLM-based baselines like chain-of-thought prompting.
LLMs can evolve surprisingly effective, interpretable Python planners that rival state-of-the-art classical planners, at a fraction of the computational cost.
We present GenePlan (GENeralized Evolutionary Planner), a novel framework that leverages large language model (LLM) assisted evolutionary algorithms to generate domain-dependent generalized planners for classical planning tasks described in PDDL. By casting generalized planning as an optimization problem, GenePlan iteratively evolves interpretable Python planners that minimize plan length across diverse problem instances. In empirical evaluation across six existing benchmark domains and two new domains, GenePlan achieved an average SAT score of 0.91, closely matching the performance of the state-of-the-art planners (SAT score 0.93), and significantly outperforming other LLM-based baselines such as chain-of-thought (CoT) prompting (average SAT score 0.64). The generated planners solve new instances rapidly (average 0.49 seconds per task) and at low cost (average $1.82 per domain using GPT-4o).