Feb 16, 2026arXiv:2602.14934

Activation-Space Uncertainty Quantification for Pretrained Networks

Richard Bergna, Stefan Depeweg, Sergio Calvo-Ordoñez, Jonathan Plenk, Alvaro Cartea, Jose Miguel Hernández-Lobato

AI Summary

The paper introduces Gaussian Process Activations (GAPA), a post-hoc uncertainty quantification method that replaces standard nonlinearities in pretrained networks with Gaussian process activations. GAPA preserves the original model's point predictions by ensuring the posterior mean of the GP activations matches the original activations, while also providing closed-form epistemic variances in activation space. By using a sparse variational inducing-point approximation and k-NN subset conditioning, GAPA scales to large architectures and achieves competitive calibration and OOD detection performance compared to existing post-hoc methods, without requiring retraining or sampling.

Key Contribution

Get calibrated uncertainty from your frozen pretrained models without retraining or Monte Carlo sampling: just swap in Gaussian Process Activations (GAPA).

Abstract

Reliable uncertainty estimates are crucial for deploying pretrained models; yet, many strong methods for quantifying uncertainty require retraining, Monte Carlo sampling, or expensive second-order computations and may alter a frozen backbone's predictions. To address this, we introduce Gaussian Process Activations (GAPA), a post-hoc method that shifts Bayesian modeling from weights to activations. GAPA replaces standard nonlinearities with Gaussian-process activations whose posterior mean exactly matches the original activation, preserving the backbone's point predictions by construction while providing closed-form epistemic variances in activation space. To scale to modern architectures, we use a sparse variational inducing-point approximation over cached training activations, combined with local k-nearest-neighbor subset conditioning, enabling deterministic single-pass uncertainty propagation without sampling, backpropagation, or second-order information. Across regression, classification, image segmentation, and language modeling, GAPA matches or outperforms strong post-hoc baselines in calibration and out-of-distribution detection while remaining efficient at test time.

Inference & Quantization Training Efficiency & Optimization

Citation Metrics

Citations0

Influential citations0

References0

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

Activation-Space Uncertainty Quantification for Pretrained Networks

Related Papers