Revisiting Approximate Leverage Score Sketching for Matrix Least Squares

Abstract

We revisit the problem of sketching using approximate leverage scores for matrix least squares problems of the form $|| AX - B ||_F^2$ where the design matrix $A \in \mathbb{R}^{N \times r}$ is tall and skinny with $N \gg r$. We derive the theoretical results from first principles and clarify the relation to previously stated bounds, improving some constants along the way. One can characterize the utility of a sketching scheme according to the number of samples it needs for an $\varepsilon$-accurate solution with high probability. Assuming $\varepsilon$ is suitably small, we will show that approximate leverage score sampling requires $4r/(\beta\delta\varepsilon)$ samples, where $\delta$ is the failure probability and $\beta\in (0,1]$ is a measure of the quality of the approximate leverage scores such that $\beta=1$ corresponds to using exact leverage scores. In cases where a few approximate leverage scores are very large (summing to $p_{\rm det}$), we also show that using a hybrid deterministic and random sampling scheme reduces the required number of samples by a factor of $1/(1-p_{\rm det})$.

Publication
arXiv
Date
Citation
B. W. Larsen, T. G. Kolda. Revisiting Approximate Leverage Score Sketching for Matrix Least Squares. arXiv:2201.10638, 2026. http://arxiv.org/abs/2201.10638

Keywords

Numerical Analysis (math.NA), Data Structures and Algorithms (cs.DS), FOS: Mathematics, FOS: Computer and information sciences

BibTeX

@misc{LaKo26,  
author = {Larsen, Brett W. and Kolda, Tamara G.}, 
title = {Revisiting Approximate Leverage Score Sketching for Matrix Least Squares}, 
month = {March}, 
year = {2026},
eprint = {2201.10638},
eprintclass = {math.NA},
}