Search papers, labs, and topics across Lattice.
The paper introduces a novel GAN architecture design based on deep unfolding of Langevin MCMC algorithms for efficient high-dimensional posterior sampling. This approach maps fixed-step MCMC iterations onto modular neural networks, enabling flexibility, interpretability, and robustness to changes in likelihood parameters. Experiments on Bayesian imaging demonstrate high sampling accuracy, computational efficiency, and preservation of physics consistency compared to traditional MCMC and other GAN-based sampling methods.
By deeply unfolding MCMC kernels into GAN architectures, this work achieves both the speed of generative models and the interpretability of Bayesian methods for posterior sampling.
Markov chain Monte Carlo (MCMC) methods are fundamental to Bayesian computation, but can be computationally intensive, especially in high-dimensional settings. Push-forward generative models, such as generative adversarial networks (GANs), variational auto-encoders and normalising flows offer a computationally efficient alternative for posterior sampling. However, push-forward models are opaque as they lack the modularity of Bayes Theorem, leading to poor generalisation with respect to changes in the likelihood function. In this work, we introduce a novel approach to GAN architecture design by applying deep unfolding to Langevin MCMC algorithms. This paradigm maps fixed-step iterative algorithms onto modular neural networks, yielding architectures that are both flexible and amenable to interpretation. Crucially, our design allows key model parameters to be specified at inference time, offering robustness to changes in the likelihood parameters. We train these unfolded samplers end-to-end using a supervised regularized Wasserstein GAN framework for posterior sampling. Through extensive Bayesian imaging experiments, we demonstrate that our proposed approach achieves high sampling accuracy and excellent computational efficiency, while retaining the physics consistency, adaptability and interpretability of classical MCMC strategies.