Search papers, labs, and topics across Lattice.
This paper introduces LSRTR-M, an accelerated algorithm for estimating Low Separation Rank Tensor Generalized Linear Models (LSR-TGLMs) that are used to model tensor-valued data in multidimensional signal and imaging problems. The key innovation is the replacement of computationally expensive QR-based projection steps in the original LSRTR algorithm with Muon (MomentUm Orthogonalized by Newton-Schulz) updates within a block coordinate descent scheme. Experiments on synthetic and real-world datasets demonstrate that LSRTR-M achieves faster convergence, lower estimation/prediction errors, and improved computational efficiency compared to LSRTR.
Ditch slow QR projections: Muon-accelerated updates slash computation time for tensor GLMs without sacrificing accuracy.
Tensor-valued data arise naturally in multidimensional signal and imaging problems, such as biomedical imaging. When incorporated into generalized linear models (GLMs), naive vectorization can destroy their multi-way structure and lead to high-dimensional, ill-posed estimation. To address this challenge, Low Separation Rank (LSR) decompositions reduce model complexity by imposing low-rank multilinear structure on the coefficient tensor. A representative approach for estimating LSR-based tensor GLMs (LSR-TGLMs) is the Low Separation Rank Tensor Regression (LSRTR) algorithm, which adopts block coordinate descent and enforces orthogonality of the factor matrices through repeated QR-based projections. However, the repeated projection steps can be computationally demanding and slow convergence. Motivated by the need for scalable estimation and classification from such data, we propose LSRTR-M, which incorporates Muon (MomentUm Orthogonalized by Newton-Schulz) updates into the LSRTR framework. Specifically, LSRTR-M preserves the original block coordinate scheme while replacing the projection-based factor updates with Muon steps. Across synthetic linear, logistic, and Poisson LSR-TGLMs, LSRTR-M converges faster in both iteration count and wall-clock time, while achieving lower normalized estimation and prediction errors. On the Vessel MNIST 3D task, it further improves computational efficiency while maintaining competitive classification performance.