We revisit the problem of sketching using approximate leverage scores for matrix least squares problems of the form $|| AX - B ||_F^2$ where the design matrix $A \in \mathbb{R}^{N \times r}$ is tall and skinny with $N \gg r$. We derive the theoretical results from first principles and clarify the relation to previously stated bounds, improving some constants along the way. One can characterize the utility of a sketching scheme according to the number of samples it needs for an $\varepsilon$-accurate solution with high probability. Assuming $\varepsilon$ is suitably small, we will show that approximate leverage score sampling requires $4r/(\beta\delta\varepsilon)$ samples, where $\delta$ is the failure probability and $\beta\in (0,1]$ is a measure of the quality of the approximate leverage scores such that $\beta=1$ corresponds to using exact leverage scores. In cases where a few approximate leverage scores are very large (summing to $p_{\rm det}$), we also show that using a hybrid deterministic and random sampling scheme reduces the required number of samples by a factor of $1/(1-p_{\rm det})$.
Numerical Analysis (math.NA), Data Structures and Algorithms (cs.DS), FOS: Mathematics, FOS: Computer and information sciences
@misc{LaKo26,
author = {Larsen, Brett W. and Kolda, Tamara G.},
title = {Revisiting Approximate Leverage Score Sketching for Matrix Least Squares},
month = {March},
year = {2026},
eprint = {2201.10638},
eprintclass = {math.NA},
}