Search papers, labs, and topics across Lattice.
The paper introduces AI-CARE, a tool for evaluating and reporting the energy consumption and carbon emissions of ML models, addressing the lack of carbon-awareness in existing benchmarks. It proposes a carbon-performance tradeoff curve to visualize the Pareto frontier between model performance and carbon cost. Empirical validation on ML workloads demonstrates that carbon-aware benchmarking alters model rankings, favoring architectures that balance accuracy and environmental impact.
Carbon-aware benchmarks flip the script on model rankings, revealing that top-performing AI aren't always the greenest.
As machine learning (ML) continues its rapid expansion, the environmental cost of model training and inference has become a critical societal concern. Existing benchmarks overwhelmingly focus on standard performance metrics such as accuracy, BLEU, or mAP, while largely ignoring energy consumption and carbon emissions. This single-objective evaluation paradigm is increasingly misaligned with the practical requirements of large-scale deployment, particularly in energy-constrained environments such as mobile devices, developing regions, and climate-aware enterprises. In this paper, we propose AI-CARE, an evaluation tool for reporting energy consumption, and carbon emissions of ML models. In addition, we introduce the carbon-performance tradeoff curve, an interpretable tool that visualizes the Pareto frontier between performance and carbon cost. We demonstrate, through theoretical analysis and empirical validation on representative ML workloads, that carbon-aware benchmarking changes the relative ranking of models and encourages architectures that are simultaneously accurate and environmentally responsible. Our proposal aims to shift the research community toward transparent, multi-objective evaluation and align ML progress with global sustainability goals. The tool and documentation are available at https://github.com/USD-AI-ResearchLab/ai-care.