Search papers, labs, and topics across Lattice.
This paper introduces an efficient method for credal prediction, representing uncertainty as intervals of plausible probabilities for each class label. The method, termed "decalibration," is inspired by calibration techniques for probabilistic classifiers and grounded in relative likelihood. Experiments demonstrate strong performance in coverage-efficiency, out-of-distribution detection, and in-context learning, enabling credal prediction for complex models like TabPFN and CLIP where it was previously infeasible.
Credal sets, previously impractical for large models, are now efficiently computable via a "decalibration" method that delivers strong performance in uncertainty-aware tasks.
A reliable representation of uncertainty is essential for the application of modern machine learning methods in safety-critical settings. In this regard, the use of credal sets (i.e., convex sets of probability distributions) has recently been proposed as a suitable approach to representing epistemic uncertainty. However, as with other approaches to epistemic uncertainty, training credal predictors is computationally complex and usually involves (re-)training an ensemble of models. The resulting computational complexity prevents their adoption for complex models such as foundation models and multi-modal systems. To address this problem, we propose an efficient method for credal prediction that is grounded in the notion of relative likelihood and inspired by techniques for the calibration of probabilistic classifiers. For each class label, our method predicts a range of plausible probabilities in the form of an interval. To produce the lower and upper bounds of these intervals, we propose a technique that we refer to as decalibration. Extensive experiments show that our method yields credal sets with strong performance across diverse tasks, including coverage-efficiency evaluation, out-of-distribution detection, and in-context learning. Notably, we demonstrate credal prediction on models such as TabPFN and CLIP -- architectures for which the construction of credal sets was previously infeasible.