Search papers, labs, and topics across Lattice.
This paper introduces a unified Majorize-Minimisation (MM) framework for both traditional and convex Non-negative Matrix Factorization (NMF) under Tweedie and Negative Binomial distributional assumptions, addressing limitations of standard Gaussian or Poisson noise models. The authors derive multiplicative update rules for all models, including novel updates for convex NMF with Poisson and Negative Binomial cost functions. Empirical evaluations on mutational and word count data show that the choice of noise model significantly impacts model fit and feature recovery, with convex NMF offering a robust alternative when the number of classes is large.
Ditching Gaussian and Poisson noise assumptions in NMF can dramatically improve model fit and feature recovery, especially when using Tweedie and Negative Binomial distributions for overdispersed data.
Non-negative matrix factorisation (NMF) is a widely used tool for unsupervised learning and feature extraction, with applications ranging from genomics to text analysis and signal processing. Standard formulations of NMF are typically derived under Gaussian or Poisson noise assumptions, which may be inadequate for data exhibiting overdispersion or other complex mean-variance relationships. In this paper, we develop a unified framework for both traditional and convex NMF under a broad class of distributional assumptions, including Negative Binomial and Tweedie models, where the connection between the Tweedie and the $β$-divergence is also highlighted. Using a Majorize-Minimisation approach, we derive multiplicative update rules for all considered models, and novel updates for convex NMF with Poisson and Negative Binomial cost functions. We provide a unified implementation of all considered models, including the first implementations of several convex NMF models. Empirical evaluations on mutational and word count data demonstrate that the choice of noise model critically affects model fit and feature recovery, and that convex NMF can provide an efficient and robust alternative to traditional NMF in scenarios where the number of classes is large. The code for our proposed updates is available in the R package nmfgenr and can be found at https://github.com/MartaPelizzola/nmfgenr.