Mar 2, 2026arXiv:2603.01800

Phase-Type Variational Autoencoders for Heavy-Tailed Data

Abdelhakim Ziani, András Horváth, Paolo Ballarini

AI Summary

The paper introduces Phase-Type Variational Autoencoders (PH-VAEs) to address the limitations of standard VAEs in modeling heavy-tailed data. PH-VAEs use a latent-conditioned Phase-Type (PH) distribution as the decoder, enabling the model to adapt its tail behavior directly from the data by composing multiple exponential time scales. Experiments on synthetic and real-world data demonstrate that PH-VAE outperforms Gaussian, Student-t, and extreme-value-based VAE decoders in capturing tail behavior and extreme quantiles, while also capturing realistic cross-dimensional tail dependence in multivariate settings.

Key Contribution

Ditch the Gaussian decoder: Phase-Type VAEs learn heavy-tailed distributions directly from data, outperforming alternatives in modeling extreme events.

Abstract

Heavy-tailed distributions are ubiquitous in real-world data, where rare but extreme events dominate risk and variability. However, standard Variational Autoencoders (VAEs) employ simple decoder distributions (e.g., Gaussian) that fail to capture heavy-tailed behavior, while existing heavy-tail-aware extensions remain restricted to predefined parametric families whose tail behavior is fixed a priori. We propose the Phase-Type Variational Autoencoder (PH-VAE), whose decoder distribution is a latent-conditioned Phase-Type (PH) distribution defined as the absorption time of a continuous-time Markov chain (CTMC). This formulation composes multiple exponential time scales, yielding a flexible and analytically tractable decoder that adapts its tail behavior directly from the observed data. Experiments on synthetic and real-world benchmarks demonstrate that PH-VAE accurately recovers diverse heavy-tailed distributions, significantly outperforming Gaussian, Student-t, and extreme-value-based VAE decoders in modeling tail behavior and extreme quantiles. In multivariate settings, PH-VAE captures realistic cross-dimensional tail dependence through its shared latent representation. To our knowledge, this is the first work to integrate Phase-Type distributions into deep generative modeling, bridging applied probability and representation learning.

Architecture Design (Transformers, SSMs, MoE)Training Efficiency & Optimization

Citation Metrics

Citations0

Influential citations0

References0

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

Phase-Type Variational Autoencoders for Heavy-Tailed Data

Related Papers