MIT CSAILColumbiaRitualFeb 19, 2026arXiv:2602.17223

Privacy-Preserving Mechanisms Enable Cheap Verifiable Inference of LLMs

Arka Pal, Louai Zahran, William Gvozdjak, Akilesh Potti, Micah Goldblum

AI Summary

This paper introduces two novel protocols for verifiable inference of LLMs that leverage privacy-preserving inference techniques to provide guarantees about the computation performed. By exploiting the properties of privacy-preserving methods, the protocols achieve verification with minimal overhead, requiring only a few extra tokens of computation. The proposed approaches demonstrate improved verification runtime compared to existing zero-knowledge proof-based methods, highlighting a connection between privacy and verifiability in LLM inference.

Key Contribution

Verifiable LLM inference becomes practical: privacy-preserving techniques unlock verification at near-zero cost, outperforming ZKPs.

Abstract

As large language models (LLMs) continue to grow in size, fewer users are able to host and run models locally. This has led to increased use of third-party hosting services. However, in this setting, there is a lack of guarantees on the computation performed by the inference provider. For example, a dishonest provider may replace an expensive large model with a cheaper-to-run weaker model and return the results from the weaker model to the user. Existing tools to verify inference typically rely on methods from cryptography such as zero-knowledge proofs (ZKPs), but these add significant computational overhead, and remain infeasible for use for large models. In this work, we develop a new insight -- that given a method for performing private LLM inference, one can obtain forms of verified inference at marginal extra cost. Specifically, we propose two new protocols which leverage privacy-preserving LLM inference in order to provide guarantees over the inference that was carried out. Our approaches are cheap, requiring the addition of a few extra tokens of computation, and have little to no downstream impact. As the fastest privacy-preserving inference methods are typically faster than ZK methods, the proposed protocols also improve verification runtime. Our work provides novel insights into the connections between privacy and verifiability in LLM inference.

Inference & Quantization Open-Source Models & Weights

Citation Metrics

Citations0

Influential citations0

References0

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

Privacy-Preserving Mechanisms Enable Cheap Verifiable Inference of LLMs

Related Papers