Search papers, labs, and topics across Lattice.
The paper introduces Gaussian Approximation of CTGAN (GACTGAN), a novel Bayesian GAN approach for tabular data synthesis that integrates Stochastic Weight Averaging-Gaussian (SWAG) into the CTGAN generator to approximate the Bayesian posterior. This method addresses the limitations of standard CTGANs in navigating the risk-utility trade-off and avoids the computational expense of MCMC-based Bayesian GANs. Experiments demonstrate that GACTGAN generates synthetic tabular data with improved preservation of structure and inferential statistics, while also reducing privacy risk compared to CTGAN.
GACTGAN offers a computationally cheaper and more effective Bayesian GAN for tabular data synthesis, outperforming CTGAN in utility and privacy.
Generative Adversarial Networks (GAN) have been used in many studies to synthesise mixed tabular data. Conditional tabular GAN (CTGAN) have been the most popular variant but struggle to effectively navigate the risk-utility trade-off. Bayesian GAN have received less attention for tabular data, but have been explored with unstructured data such as images and text. The most used technique employed in Bayesian GAN is Markov Chain Monte Carlo (MCMC), but it is computationally intensive, particularly in terms of weight storage. In this paper, we introduce Gaussian Approximation of CTGAN (GACTGAN), an integration of the Bayesian posterior approximation technique using Stochastic Weight Averaging-Gaussian (SWAG) within the CTGAN generator to synthesise tabular data, reducing computational overhead after the training phase. We demonstrate that GACTGAN yields better synthetic data compared to CTGAN, achieving better preservation of tabular structure and inferential statistics with less privacy risk. These results highlight GACTGAN as a simpler, effective implementation of Bayesian tabular synthesis.