Search papers, labs, and topics across Lattice.
The paper introduces HAWX, a hardware-aware framework for efficient exploration of approximate Deep Neural Network (DNN) configurations by using multi-level sensitivity scoring across different DNN abstraction levels (operator, filter, layer, and model). HAWX employs predictive models for accuracy, power, and area to accelerate the evaluation of candidate configurations with heterogeneous approximate computing (AxC) blocks. Results show significant speedups in layer-level and filter-level searches while maintaining accuracy comparable to exhaustive search, and the efficiency benefits scale exponentially with network size on benchmarks like VGG-11, ResNet-18, and EfficientNetLite.
Forget exhaustive search: HAWX achieves up to 3 million times speedup in DNN approximation by intelligently integrating heterogeneous approximate computing blocks, guided by multi-level sensitivity analysis.
This work presents HAWX, a hardware-aware scalable exploration framework that employs multi-level sensitivity scoring at different DNN abstraction levels (operator, filter, layer, and model) to guide selective integration of heterogeneous AxC blocks. Supported by predictive models for accuracy, power, and area, HAWX accelerates the evaluation of candidate configurations, achieving over 23* speedup in a layer-level search with two candidate approximate blocks and more than (3*106)* speedup at the filter-level search only for LeNet-5, while maintaining accuracy comparable to exhaustive search. Experiments across state-of-the-art DNN benchmarks such as VGG-11, ResNet-18, and EfficientNetLite demonstrate that the efficiency benefits of HAWX scale exponentially with network size. The HAWX hardware-aware search algorithm supports both spatial and temporal accelerator architectures, leveraging either off-the-shelf approximate components or customized designs.