Tight bounds for lp oblivious subspace embeddings

Published: 06 January 2019 Publication History


An lp oblivious subspace embedding is a distribution over r x n matrices Π such that for any fixed n x d matrix A,
Pr[for all x, ||Ax||p ≤||ΠAx||pk||Ax||p] ≥ 9/10,Π
where r is the dimension of the embedding, k is the distortion of the embedding, and for an n-dimensional vector y, ||y||p = (Σni=1 |yi|)1/p is the lp-norm. Another important property is the sparsity of Π, that is, the maximum number of non-zero entries per column, as this determines the running time of computing Π · A. While for p = 2 there are nearly optimal tradeoffs in terms of the dimension, distortion, and sparsity, for the important case of 1 ≤ p < 2, much less was known. In this paper we obtain nearly optimal tradeoffs for lp oblivious subspace embeddings for every 1 ≤ p < 2. Our main results are as follows:
1. We show for every 1 ≤ p < 2, any oblivious subspace embedding with dimension r has distortion k = [MATH HERE]. When r = poly(d) ⩽ n in applications, this gives a k = Ω(d1/p log−2/p d) lower bound, and shows the oblivious subspace embedding of Sohler and Woodruff (STOC, 2011) for p = 1 and the oblivious subspace embedding of Meng and Mahoney (STOC, 2013) for 1 < p < 2 are optimal up to poly(log(d)) factors.
2. We give sparse oblivious subspace embeddings for every 1 ≤ p < 2 which are optimal in dimension and distortion, up to poly(log d) factors. Importantly for p = 1, we achieve r = O(d log d), k = O(d log d) and s = O(log d) non-zero entries per column. The best previous construction with s ≤ poly(log d) is due to Woodruff and Zhang (COLT, 2013), giving k = Ω(d2poly(log d)) or [MATH HERE] and rd · poly(log d); in contrast our r = O(d log d) and k = O(d log d) are optimal up to poly(log(d)) factors even for dense matrices.
We also give (1) nearly-optimal lp oblivious subspace embeddings with an expected 1 + ε number of non-zero entries per column for arbitrarily small ε > 0, and (2) the first oblivious subspace embeddings for 1 ≤ p < 2 with O(1)-distortion and dimension independent of n. Oblivious subspace embeddings are crucial for distributed and streaming environments, as well as entrywise lp low rank approximation. Our results give improved algorithms for these applications.


  • (2022)Tight Bounds for ℓ1 Oblivious Subspace EmbeddingsACM Transactions on Algorithms10.1145/347753718:1(1-32)Online publication date: 24-Jan-2022
  • (2020)Nearly linear row sampling algorithm for quantile regressionProceedings of the 37th International Conference on Machine Learning10.5555/3524938.3525493(5979-5989)Online publication date: 13-Jul-2020
    Published In

    SODA '19: Proceedings of the Thirtieth Annual ACM-SIAM Symposium on Discrete Algorithms
    January 2019
    2993 pages


    Publication History

    Published: 06 January 2019

    SODA '19
    SODA '19: Symposium on Discrete Algorithms
    January 6 - 9, 2019
    California, San Diego

    Overall Acceptance Rate 411 of 1,322 submissions, 31%


    • (2022)Tight Bounds for ℓ1 Oblivious Subspace EmbeddingsACM Transactions on Algorithms10.1145/347753718:1(1-32)Online publication date: 24-Jan-2022
    • (2020)Nearly linear row sampling algorithm for quantile regressionProceedings of the 37th International Conference on Machine Learning10.5555/3524938.3525493(5979-5989)Online publication date: 13-Jul-2020

