Search papers, labs, and topics across Lattice.
This paper investigates the ability of LLMs to perform architectural reasoning for networked systems design, finding that they often fail to satisfy critical constraints and encode incorrect assumptions. To address this, the authors introduce Kepler, a lightweight reasoning framework that combines structured, expert-driven specifications with SMT-based optimization to synthesize feasible designs. Experiments demonstrate that Kepler uncovers interactions missed by LLMs and supports systematic, explainable design exploration.
LLMs might sound good at designing networked systems, but they're surprisingly bad at avoiding configurations that violate basic constraints, highlighting the need for structured reasoning frameworks like Kepler.
Designing the architecture of modern networked systems requires navigating a large, combinatorial space of hardware, systems, and configuration choices with complex cross-layer interactions. Architects must balance competing objectives such as performance, cost, and deployability while satisfying compatibility and resource constraints, often relying on scattered rules-of-thumb drawn from benchmarks, papers, documentation, and expert experience. This raises a natural question: can large language models (LLMs) reliably perform this kind of architectural reasoning? We find that they cannot. While LLMs produce plausible configurations, they frequently miss critical constraints, encode incorrect assumptions, and exhibit ``stickiness'' to familiar patterns. A natural workaround--iterative validation via simulation or experimentation--is often prohibitively expensive at scale and, in many cases, infeasible, particularly when comparing hardware-dependent alternatives. Motivated by this gap, we present Kepler, a lightweight reasoning framework for architecture design that combines structured, expert-driven specifications with SMT-based optimization. Kepler encodes architecturally significant properties--requirements, incompatibilities, and qualitative trade-offs--about systems, hardware, and workloads as constraints, and synthesizes feasible designs that optimize user-defined objectives. It operates at an abstract level, capturing ``rules-of-thumb'' rather than detailed system behavior, enabling tractable reasoning while preserving key interactions, and provides explanations for its decisions. Through experiments and case studies, we show that Kepler uncovers interactions missed by LLMs and supports systematic, explainable design exploration.