Brandenburgische Technische Universität CottbusShahid Bahonar UniversityTallinn University of TechnologyUniversity of ZanjanFeb 18, 2026arXiv:2602.16336

HAWX: A Hardware-Aware FrameWork for Fast and Scalable ApproXimation of DNNs

Samira Nazari, Mohammad Saeed Almasi, Mahdi Taheri, Ali Azarpeyvand, Ali Mokhtari, Ali Mahani, Christian Herglotz

AI Summary

The paper introduces HAWX, a hardware-aware framework for efficient exploration of approximate Deep Neural Network (DNN) configurations by using multi-level sensitivity scoring across different DNN abstraction levels (operator, filter, layer, and model). HAWX employs predictive models for accuracy, power, and area to accelerate the evaluation of candidate configurations with heterogeneous approximate computing (AxC) blocks. Results show significant speedups in layer-level and filter-level searches while maintaining accuracy comparable to exhaustive search, and the efficiency benefits scale exponentially with network size on benchmarks like VGG-11, ResNet-18, and EfficientNetLite.

Key Contribution

Forget exhaustive search: HAWX achieves up to 3 million times speedup in DNN approximation by intelligently integrating heterogeneous approximate computing blocks, guided by multi-level sensitivity analysis.

Abstract

This work presents HAWX, a hardware-aware scalable exploration framework that employs multi-level sensitivity scoring at different DNN abstraction levels (operator, filter, layer, and model) to guide selective integration of heterogeneous AxC blocks. Supported by predictive models for accuracy, power, and area, HAWX accelerates the evaluation of candidate configurations, achieving over 23* speedup in a layer-level search with two candidate approximate blocks and more than (3*106)* speedup at the filter-level search only for LeNet-5, while maintaining accuracy comparable to exhaustive search. Experiments across state-of-the-art DNN benchmarks such as VGG-11, ResNet-18, and EfficientNetLite demonstrate that the efficiency benefits of HAWX scale exponentially with network size. The HAWX hardware-aware search algorithm supports both spatial and temporal accelerator architectures, leveraging either off-the-shelf approximate components or customized designs.

Architecture Design (Transformers, SSMs, MoE)Distributed Systems & Hardware Inference & Quantization

Citation Metrics

Citations0

Influential citations0

References0

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

HAWX: A Hardware-Aware FrameWork for Fast and Scalable ApproXimation of DNNs

Related Papers