Search papers, labs, and topics across Lattice.
This paper formalizes and quantifies the conditioning consistency gap in Conditional Neural Processes (CNPs), which measures the discrepancy between adding a data point to the context set versus conditioning on it. They prove that for CNPs with bounded encoders and Lipschitz decoders, the consistency gap diminishes at a rate of $O(1/n^2)$ with respect to the context size $n$, and that this rate is tight. This provides a theoretical justification for why CNPs, despite violating Kolmogorov consistency, still perform well in practice, especially with larger context sets.
CNPs are provably inconsistent with true stochastic processes, but this work shows *how much* they deviate, with a tight $O(1/n^2)$ bound on the conditioning consistency gap.
Neural processes are meta-learning models that map context sets to predictive distributions. While inspired by stochastic processes, NPs do not generally satisfy the Kolmogorov consistency conditions required to define a valid stochastic process. This inconsistency is widely acknowledged but poorly understood. Practitioners note that NPs work well despite the violation, without quantifying what this means. We address this gap by defining the conditioning consistency gap, a KL divergence measuring how much a conditional neural process's (CNP) predictions change when a point is added to the context versus conditioned upon. Our main results show that for CNPs with bounded encoders and Lipschitz decoders, the consistency gap is $O(1/n^2)$ in context size $n$, and that this rate is tight. These bounds establish the precise sense in which CNPs approximate valid stochastic processes. The inconsistency is negligible for moderate context sizes but can be significant in the few-shot regime.