Apr 16, 2026arXiv:2604.15141

KVNN: Learnable Multi-Kernel Volterra Neural Networks

Haoyu Yun, Haoyu Yun, Hamid Krim, Hamid Krim, Yufang Bao

AI Summary

This paper introduces a kernelized Volterra Neural Network (kVNN) that leverages learnable multi-kernel representations to model different interaction orders with distinct polynomial-kernel components. By using order-adaptive parameterization and compact, learnable centers, kVNN achieves efficiency in higher-order learning. Experiments on video action recognition and image denoising demonstrate that kVNN reduces model complexity and computational cost while maintaining or improving performance, even without pretraining.

Key Contribution

Achieve comparable or better performance with significantly reduced model size and computational cost by replacing standard convolutional kernels with structured, kernelized higher-order layers.

Abstract

Higher-order learning is fundamentally rooted in exploiting compositional features. It clearly hinges on enriching the representation by more elaborate interactions of the data which, in turn, tends to increase the model complexity of conventional large-scale deep learning models. In this paper, a kernelized Volterra Neural Network (kVNN) is proposed. The key to the achieved efficiency lies in using a learnable multi-kernel representation, where different interaction orders are modeled by distinct polynomial-kernel components with compact, learnable centers, yielding an order-adaptive parameterization. Features are learned by the composition of layers, each of which consists of parallel branches of different polynomial orders, enabling kVNN filters to directly replace standard convolutional kernels within existing architectures. The theoretical results are substantiated by experiments on two representative tasks: video action recognition and image denoising. The results demonstrate favorable performance-efficiency trade-offs: kVNN consistently yields reduced model (parameters) and computational (GFLOPs) complexity with competitive and often improved performance. These results are maintained even when trained from scratch without large-scale pretraining. In summary, we substantiate that structured kernelized higher-order layers offer a practical path to balancing expressivity and computational cost in modern deep networks.

Architecture Design (Transformers, SSMs, MoE)Training Efficiency & Optimization

Citation Metrics

Citations0

Influential citations0

References37

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

KVNN: Learnable Multi-Kernel Volterra Neural Networks

Related Papers