Search papers, labs, and topics across Lattice.
The paper introduces SpecMind, a framework for postcondition inference that uses LLMs in an interactive, multi-turn fashion instead of single-pass prompting. SpecMind iteratively refines candidate postconditions by incorporating both implicit and explicit correctness feedback, allowing the model to autonomously decide when to stop refining. Experiments demonstrate that SpecMind significantly outperforms state-of-the-art approaches in both accuracy and completeness of generated postconditions.
LLMs can generate more accurate and complete postconditions by engaging in iterative refinement guided by correctness feedback, rather than relying on single-pass prompting.
Specifications are vital for ensuring program correctness, yet writing them manually remains challenging and time-intensive. Recent large language model (LLM)-based methods have shown successes in generating specifications such as postconditions, but existing single-pass prompting often yields inaccurate results. In this paper, we present SpecMind, a novel framework for postcondition generation that treats LLMs as interactive and exploratory reasoners rather than one-shot generators. SpecMind employs feedback-driven multi-turn prompting approaches, enabling the model to iteratively refine candidate postconditions by incorporating implicit and explicit correctness feedback, while autonomously deciding when to stop. This process fosters deeper code comprehension and improves alignment with true program behavior via exploratory attempts. Our empirical evaluation shows that SpecMind significantly outperforms state-of-the-art approaches in both accuracy and completeness of generated postconditions.