ColumbiaApr 21, 2026arXiv:2604.19444

Unsupervised Confidence Calibration for Reasoning LLMs from a Single Generation

AI Summary

This paper introduces an unsupervised confidence calibration method for reasoning LLMs that requires only a single generation at inference time. The approach uses offline sampling on unlabeled data to create a self-consistency-based proxy target, which is then distilled into a lightweight confidence predictor. Experiments across 5 tasks and 9 models demonstrate significant improvements over baselines, even under distribution shift, enhancing performance in selective prediction and simulated decision-making.

Key Contribution

Reasoning LLMs can now produce well-calibrated confidence estimates without labels or repeated sampling, unlocking more reliable real-world deployment.

Abstract

Reasoning language models can solve increasingly complex tasks, but struggle to produce the calibrated confidence estimates necessary for reliable deployment. Existing calibration methods usually depend on labels or repeated sampling at inference time, making them impractical in many settings. We introduce a method for unsupervised confidence calibration of reasoning LLMs when only a single generation is available at inference time. Our approach uses offline sampling on unlabeled data to derive a self-consistency-based proxy target, then distills this signal into a lightweight deployment-time confidence predictor. In a broad evaluation across 5 math and question-answering tasks using 9 reasoning models, our method substantially outperforms baselines, including under distribution shift, and improves downstream performance in selective prediction and simulated downstream decision-making.

Eval Frameworks & Benchmarks Natural Language Processing Reasoning & Chain-of-Thought

Citation Metrics

Citations0

Influential citations0

References0

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

Unsupervised Confidence Calibration for Reasoning LLMs from a Single Generation

Related Papers