Search papers, labs, and topics across Lattice.
Wattlytics is introduced as a web-based platform for co-optimizing performance, energy consumption, and total cost of ownership (TCO) in HPC clusters. It integrates benchmark-driven GPU performance scaling with DVFS-aware power modeling and multi-year TCO analysis, enabling users to explore heterogeneous systems and deployment scenarios. The platform computes multidimensional decision metrics and supports design-space exploration, demonstrating that energy-efficient GPUs can outperform higher-performance alternatives under budget or energy constraints.
Stop blindly buying the fastest GPUs: Wattlytics reveals how energy-efficient GPUs can actually be more cost-effective for HPC under realistic budget and energy constraints.
The escalating computational demands and energy footprint of GPU-accelerated computing systems complicate informed design and operational decisions. We present the first release of Wattlytics (https://wattlytics.netlify.app), an interactive, browser-based decision-support system. Unlike existing procurement-oriented calculators, Wattlytics uniquely integrates benchmark-driven GPU performance scaling, dynamic voltage and frequency scaling (DVFS)-aware piecewise power modeling, and multi-year total cost of ownership (TCO) analysis within a single interactive environment. Users can configure heterogeneous systems across contemporary GPU architectures (GH200, H100, L40S, L40, A40, A100, and L4), select representative scientific workloads (e.g., GROMACS, AMBER), and explore deployment scenarios under constraints such as energy prices, system lifetime, and frequency scaling. Wattlytics computes multidimensional decision metrics (TCO breakdown, work-per-TCO, power-per-TCO, and work-per-watt-per-TCO) and supports design-space exploration, what-if scenarios, sensitivity metrics (elasticity, Sobol indices, Monte Carlo) and collaborative features to guide realistic cluster design and procurement under uncertainty. We demonstrate selected scenarios comparing deployment strategies under different operational modes: ixed budget, fixed GPU count, fixed performance, and fixed power. Our case studies show that, under budget or energy constraints, optimally deployed energy-efficient GPUs can outperform higher-performance alternatives in overall cost-effectiveness. Wattlytics helps users explore the design parameter space and distinguish between cost- and risk-driving factors, turning HPC design into a well-informed and explainable decision-making process.