Search papers, labs, and topics across Lattice.
This paper introduces unbiased randomized approximations of vector-Jacobian products (VJPs) to reduce the computational and memory costs of backpropagation in deep neural networks. The authors provide a theoretical analysis quantifying the trade-off between per-epoch cost reduction and the number of epochs required to reach a target precision. They identify specific unbiased VJP estimators with minimal variance under sparsity constraints and empirically validate their approach on MLPs, BagNets, and Visual Transformers.
Cut deep learning costs without sacrificing accuracy by swapping exact backpropagation for unbiased, randomized approximations of vector-Jacobian products.
In this work we introduce methods to reduce the computational and memory costs of training deep neural networks. Our approach consists in replacing exact vector-jacobian products by randomized, unbiased approximations thereof during backpropagation. We provide a theoretical analysis of the trade-off between the number of epochs needed to achieve a target precision and the cost reduction for each epoch. We then identify specific unbiased estimates of vector-jacobian products for which we establish desirable optimality properties of minimal variance under sparsity constraints. Finally we provide in-depth experiments on multi-layer perceptrons, BagNets and Visual Transfomers architectures. These validate our theoretical results, and confirm the potential of our proposed unbiased randomized backpropagation approach for reducing the cost of deep learning.