Search papers, labs, and topics across Lattice.
The paper introduces VIKIN, a reconfigurable hardware accelerator designed to efficiently support both Kolmogorov-Arnold Networks (KANs) and Multi-Layer Perceptrons (MLPs). VIKIN employs a pipeline execution mode and two-stage sparsity support optimized for KANs, while also enabling parallel-mode acceleration for MLPs within the same sparsity framework, addressing the lack of dedicated hardware for KANs and the continued relevance of MLPs. Experimental results on real-world datasets demonstrate that VIKIN achieves significant speedups and energy efficiency improvements compared to edge GPUs when executing KAN workloads, with minimal accuracy loss when replacing MLPs with KANs.
KANs can run 1.25x faster and 4.87x more energy-efficiently than edge GPUs, thanks to a new reconfigurable accelerator that handles both KANs and MLPs.
Recently, multi-layer perceptrons (MLPs) widely used in modern AI applications suffer from limited real-time performance due to intensive memory access overhead. Kolmogorov--Arnold Networks (KANs) have attracted increasing attention as an alternative architecture with similar structures to MLPs but improved parameter efficiency. However, the lack of dedicated hardware support limits the practical performance benefits of KANs. Moreover, since many edge workloads still rely heavily on MLPs, accelerators designed exclusively for KANs become inefficient and impractical. In this work, we present VIKIN, a reconfigurable accelerator that efficiently supports both KAN and MLP inference using unified hardware. VIKIN introduces a pipeline execution mode and two-stage sparsity support for efficient KAN processing, while enabling parallel-mode acceleration to improve MLP throughput under the same sparsity framework. Experiments on real-world datasets demonstrate that replacing MLPs with KANs on VIKIN achieves $1.28\times$ acceleration with $19.58\%$ reduced accuracy loss. For a higher-accuracy KAN model requiring $3.29\times$ more operations, VIKIN incurs only $1.24\times$ latency overhead compared with the baseline KAN model. In addition, VIKIN achieves $1.25\times$ speedup and $4.87\times$ higher energy efficiency than a representative edge GPU when executing KAN workloads.