Search papers, labs, and topics across Lattice.
The paper introduces Phase-Type Variational Autoencoders (PH-VAEs) to address the limitations of standard VAEs in modeling heavy-tailed data. PH-VAEs use a latent-conditioned Phase-Type (PH) distribution as the decoder, enabling the model to adapt its tail behavior directly from the data by composing multiple exponential time scales. Experiments on synthetic and real-world data demonstrate that PH-VAE outperforms Gaussian, Student-t, and extreme-value-based VAE decoders in capturing tail behavior and extreme quantiles, while also capturing realistic cross-dimensional tail dependence in multivariate settings.
Ditch the Gaussian decoder: Phase-Type VAEs learn heavy-tailed distributions directly from data, outperforming alternatives in modeling extreme events.
Heavy-tailed distributions are ubiquitous in real-world data, where rare but extreme events dominate risk and variability. However, standard Variational Autoencoders (VAEs) employ simple decoder distributions (e.g., Gaussian) that fail to capture heavy-tailed behavior, while existing heavy-tail-aware extensions remain restricted to predefined parametric families whose tail behavior is fixed a priori. We propose the Phase-Type Variational Autoencoder (PH-VAE), whose decoder distribution is a latent-conditioned Phase-Type (PH) distribution defined as the absorption time of a continuous-time Markov chain (CTMC). This formulation composes multiple exponential time scales, yielding a flexible and analytically tractable decoder that adapts its tail behavior directly from the observed data. Experiments on synthetic and real-world benchmarks demonstrate that PH-VAE accurately recovers diverse heavy-tailed distributions, significantly outperforming Gaussian, Student-t, and extreme-value-based VAE decoders in modeling tail behavior and extreme quantiles. In multivariate settings, PH-VAE captures realistic cross-dimensional tail dependence through its shared latent representation. To our knowledge, this is the first work to integrate Phase-Type distributions into deep generative modeling, bridging applied probability and representation learning.