Department of MathematicsMay 6, 2026arXiv:2605.05093

Proximal Projection for Doubly Sparse Regularized Models

Jia Wei He, R. Ayesha Ali, Gerarda Darlington

AI Summary

This paper introduces a doubly sparse regularized regression model that leverages the underlying Gaussian graphical model structure of predictors by decomposing the coefficient vector into latent node contributions. A novel proximal projection method is proposed to optimize the model with a penalty function that balances L1 and L2 regularization. The method demonstrates stable performance compared to other sparse graphical regression models, particularly in high-dimensional settings, by efficiently computing the projection operator for intersecting groups.

Key Contribution

Doubly sparse regression gets a boost: this method avoids predictor duplication, saving compute, by projecting directly onto the intersection of selected groups.

Abstract

Regularization is often used in high-dimensional regression settings to generate a sparse model, which can save tremendous computing resources and identify predictors that are most strongly associated with the response. When the predictors can be represented by a Gaussian graphical model, the structure of the predictor graph can be exploited during regularization. Our proposed model exploits this underlying predictor graph structure by decomposing the estimated coefficient vector into a sum of latent variables that correspond to the sum of each node contribution to the coefficient vector. Regularization is then performed on the latent variables rather than on the coefficient vector directly. We use a penalty function that permits a clear user-defined trade-off between the L1 and L2 penalties and propose a novel proximal projection during optimization. Further, our implementation computes the projection operator for the intersection of selected groups, which conserves more computing resources compared to predictor duplication methods, especially for high-dimensional data. Through simulation, we evaluate the performance of our approach under different graph structures and node counts, and present results on real-world data. Results suggest that our method exhibits stable performance relative to other singly or doubly sparse graphical regression models.

Architecture Design (Transformers, SSMs, MoE)Training Efficiency & Optimization

Citation Metrics

Citations0

Influential citations0

References0

Year2026

VenueN/A

Related Papers

Finding related papers...

Search

Proximal Projection for Doubly Sparse Regularized Models

Related Papers