Search papers, labs, and topics across Lattice.
This paper investigates the ability of LLMs to perform feature model analysis operations (AOs) directly on semi-formal textual blueprints for Software Product Line scoping. They benchmarked 12 state-of-the-art LLMs against the solver-based oracle FLAMA using 16 standard AOs. Reasoning-optimized models like Grok 4 Fast Reasoning and Gemini 2.5 Pro achieved 88-89% average accuracy, suggesting LLMs can serve as lightweight assistants for early variability validation.
LLMs can now perform feature model analysis with near-solver accuracy directly from semi-formal blueprints, unlocking early validation in software product line scoping.
We study whether Large Language Models (LLMs) can perform feature model analysis operations (AOs) directly on semi-formal textual blueprints, i.e., concise constrained-language descriptions of feature hierarchies and constraints, enabling early validation in Software Product Line scoping. Using 12 state-of-the-art LLMs and 16 standard AOs, we compare their outputs against the solver-based oracle FLAMA. Results show that reasoning-optimized models (e.g., Grok 4 Fast Reasoning, Gemini 2.5 Pro) achieve 88-89% average accuracy across all evaluated blueprints and operations, approaching solver correctness. We identify systematic errors in structural parsing and constraint reasoning, and highlight accuracy-cost trade-offs that inform model selection. These findings position LLMs as lightweight assistants for early variability validation.