McGillFeb 24, 2026arXiv:2602.21160

Not Just How Much, But Where: Decomposing Epistemic Uncertainty into Per-Class Contributions

AI Summary

This paper addresses the limitation of using a single scalar (mutual information) to represent epistemic uncertainty in safety-critical classification tasks where the cost of failure varies across classes. The authors decompose mutual information (MI) into a per-class vector, $C_k(x)$, based on a second-order Taylor expansion of the entropy, which allows for distinguishing uncertainty related to different classes. They demonstrate the effectiveness of this decomposition in selective prediction, out-of-distribution detection, and label-noise robustness, showing improved performance compared to MI and variance baselines, particularly in scenarios with asymmetric costs and shifts.

Key Contribution

Forget a single uncertainty score – this new method decomposes epistemic uncertainty to reveal *which* classes a model is ignorant about, unlocking better safety in critical applications.

Abstract

In safety-critical classification, the cost of failure is often asymmetric, yet Bayesian deep learning summarises epistemic uncertainty with a single scalar, mutual information (MI), that cannot distinguish whether a model's ignorance involves a benign or safety-critical class. We decompose MI into a per-class vector $C_k(x)=σ_k^{2}/(2μ_k)$, with $μ_k{=}\mathbb{E}[p_k]$ and $σ_k^2{=}\mathrm{Var}[p_k]$ across posterior samples. The decomposition follows from a second-order Taylor expansion of the entropy; the $1/μ_k$ weighting corrects boundary suppression and makes $C_k$ comparable across rare and common classes. By construction $\sum_k C_k \approx \mathrm{MI}$, and a companion skewness diagnostic flags inputs where the approximation degrades. After characterising the axiomatic properties of $C_k$, we validate it on three tasks: (i) selective prediction for diabetic retinopathy, where critical-class $C_k$ reduces selective risk by 34.7\% over MI and 56.2\% over variance baselines; (ii) out-of-distribution detection on clinical and image benchmarks, where $\sum_k C_k$ achieves the highest AUROC and the per-class view exposes asymmetric shifts invisible to MI; and (iii) a controlled label-noise study in which $\sum_k C_k$ shows less sensitivity to injected aleatoric noise than MI under end-to-end Bayesian training, while both metrics degrade under transfer learning. Across all tasks, the quality of the posterior approximation shapes uncertainty at least as strongly as the choice of metric, suggesting that how uncertainty is propagated through the network matters as much as how it is measured.

Constitutional AI & AI Ethics Eval Frameworks & Benchmarks Interpretability & Mechanistic Interp

Citation Metrics

Citations0

Influential citations0

References0

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

Not Just How Much, But Where: Decomposing Epistemic Uncertainty into Per-Class Contributions

Related Papers