Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–50 of 53 results for author: Kapralov, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.16701  [pdf, ps, other

    cs.DS

    On the Streaming Complexity of Expander Decomposition

    Authors: Yu Chen, Michael Kapralov, Mikhail Makarov, Davide Mazzali

    Abstract: In this paper we study the problem of finding $(ε, φ)$-expander decompositions of a graph in the streaming model, in particular for dynamic streams of edge insertions and deletions. The goal is to partition the vertex set so that every component induces a $φ$-expander, while the number of inter-cluster edges is only an $ε$ fraction of the total volume. It was recently shown that there exists a sim… ▽ More

    Submitted 29 May, 2024; v1 submitted 25 April, 2024; originally announced April 2024.

  2. arXiv:2402.10806  [pdf, other

    cs.DS

    Streaming Algorithms for Connectivity Augmentation

    Authors: Ce Jin, Michael Kapralov, Sepideh Mahabadi, Ali Vakilian

    Abstract: We study the $k$-connectivity augmentation problem ($k$-CAP) in the single-pass streaming model. Given a $(k-1)$-edge connected graph $G=(V,E)$ that is stored in memory, and a stream of weighted edges $L$ with weights in $\{0,1,\dots,W\}$, the goal is to choose a minimum weight subset $L'\subseteq L$ such that $G'=(V,E\cup L')$ is $k$-edge connected. We give a $(2+ε)$-approximation algorithm for t… ▽ More

    Submitted 16 February, 2024; originally announced February 2024.

  3. arXiv:2402.09707  [pdf, other

    cs.DS

    On the adversarial robustness of Locality-Sensitive Hashing in Hamming space

    Authors: Michael Kapralov, Mikhail Makarov, Christian Sohler

    Abstract: Locality-sensitive hashing~[Indyk,Motwani'98] is a classical data structure for approximate nearest neighbor search. It allows, after a close to linear time preprocessing of the input dataset, to find an approximately nearest neighbor of any fixed query in sublinear time in the dataset size. The resulting data structure is randomized and succeeds with high probability for every fixed query. In m… ▽ More

    Submitted 17 June, 2024; v1 submitted 14 February, 2024; originally announced February 2024.

  4. arXiv:2401.02562  [pdf, ps, other

    cs.DS

    A Quasi-Monte Carlo Data Structure for Smooth Kernel Evaluations

    Authors: Moses Charikar, Michael Kapralov, Erik Waingarten

    Abstract: In the kernel density estimation (KDE) problem one is given a kernel $K(x, y)$ and a dataset $P$ of points in a Euclidean space, and must prepare a data structure that can quickly answer density queries: given a point $q$, output a $(1+ε)$-approximation to $μ:=\frac1{|P|}\sum_{p\in P} K(p, q)$. The classical approach to KDE is the celebrated fast multipole method of [Greengard and Rokhlin]. The fa… ▽ More

    Submitted 4 January, 2024; originally announced January 2024.

  5. arXiv:2211.11384  [pdf, ps, other

    cs.DS

    Expander Decomposition in Dynamic Streams

    Authors: Arnold Filtser, Michael Kapralov, Mikhail Makarov

    Abstract: In this paper we initiate the study of expander decompositions of a graph $G=(V, E)$ in the streaming model of computation. The goal is to find a partitioning $\mathcal{C}$ of vertices $V$ such that the subgraphs of $G$ induced by the clusters $C \in \mathcal{C}$ are good expanders, while the number of intercluster edges is small. Expander decompositions are classically constructed by a recursivel… ▽ More

    Submitted 3 August, 2023; v1 submitted 21 November, 2022; originally announced November 2022.

    Comments: 31 pages, 0 figures, ITCS 2023

  6. arXiv:2211.11328  [pdf, ps, other

    cs.DS math.NA

    Toeplitz Low-Rank Approximation with Sublinear Query Complexity

    Authors: Michael Kapralov, Hannah Lawrence, Mikhail Makarov, Cameron Musco, Kshiteej Sheth

    Abstract: We present a sublinear query algorithm for outputting a near-optimal low-rank approximation to any positive semidefinite Toeplitz matrix $T \in \mathbb{R}^{d \times d}$. In particular, for any integer rank $k \leq d$ and $ε,δ> 0$, our algorithm makes $\tilde{O} \left (k^2 \cdot \log(1/δ) \cdot \text{poly}(1/ε) \right )$ queries to the entries of $T$ and outputs a rank… ▽ More

    Submitted 21 November, 2022; originally announced November 2022.

    Comments: Accepted in SODA 2023

  7. arXiv:2209.14775  [pdf, ps, other

    cs.DS

    On Constructing Spanners from Random Gaussian Projections

    Authors: Sepehr Assadi, Michael Kapralov, Huacheng Yu

    Abstract: Graph sketching is a powerful paradigm for analyzing graph structure via linear measurements introduced by Ahn, Guha, and McGregor (SODA'12) that has since found numerous applications in streaming, distributed computing, and massively parallel algorithms, among others. Graph sketching has proven to be quite successful for various problems such as connectivity, minimum spanning trees, edge or verte… ▽ More

    Submitted 29 September, 2022; originally announced September 2022.

  8. arXiv:2207.02581  [pdf, ps, other

    cs.DS

    Learning Hierarchical Structure of Clusterable Graphs

    Authors: Michael Kapralov, Akash Kumar, Silvio Lattanzi, Aida Mousavifar

    Abstract: We consider the problem of learning the hierarchical cluster structure of graphs in the seeded model, where besides the input graph the algorithm is provided with a small number of `seeds', i.e. correctly clustered data points. In particular, we ask whether one can approximate the Dasgupta cost of a graph, a popular measure of hierarchical clusterability, in sublinear time and using a small number… ▽ More

    Submitted 6 July, 2022; originally announced July 2022.

  9. arXiv:2204.09951  [pdf, other

    cs.DS

    Motif Cut Sparsifiers

    Authors: Michael Kapralov, Mikhail Makarov, Sandeep Silwal, Christian Sohler, Jakab Tardos

    Abstract: A motif is a frequently occurring subgraph of a given directed or undirected graph $G$. Motifs capture higher order organizational structure of $G$ beyond edge relationships, and, therefore, have found wide applications such as in graph clustering, community detection, and analysis of biological and physical networks to name a few. In these applications, the cut structure of motifs plays a crucial… ▽ More

    Submitted 12 September, 2022; v1 submitted 21 April, 2022; originally announced April 2022.

    Comments: 48 pages, 3 figures

  10. arXiv:2112.07532  [pdf, ps, other

    cs.DS

    Simulating Random Walks in Random Streams

    Authors: John Kallaugher, Michael Kapralov, Eric Price

    Abstract: The random order graph streaming model has received significant attention recently, with problems such as matching size estimation, component counting, and the evaluation of bounded degree constant query testable properties shown to admit surprisingly space efficient algorithms. The main result of this paper is a space efficient single pass random order streaming algorithm for simulating nearly… ▽ More

    Submitted 14 December, 2021; originally announced December 2021.

  11. arXiv:2112.00655  [pdf, ps, other

    cs.DC cs.DS cs.LG

    Efficient and Local Parallel Random Walks

    Authors: Michael Kapralov, Silvio Lattanzi, Navid Nouri, Jakab Tardos

    Abstract: Random walks are a fundamental primitive used in many machine learning algorithms with several applications in clustering and semi-supervised learning. Despite their relevance, the first efficient parallel algorithm to compute random walks has been introduced very recently (Lacki et al.). Unfortunately their method has a fundamental shortcoming: their algorithm is non-local in that it heavily reli… ▽ More

    Submitted 1 December, 2021; originally announced December 2021.

  12. arXiv:2110.10091  [pdf, ps, other

    cs.DS

    Factorial Lower Bounds for (Almost) Random Order Streams

    Authors: Ashish Chiplunkar, John Kallaugher, Michael Kapralov, Eric Price

    Abstract: In this paper we introduce and study the \textsc{StreamingCycles} problem, a random order streaming version of the Boolean Hidden Hypermatching problem that has been instrumental in streaming lower bounds over the past decade. In this problem the edges of a graph $G$, comprising $n/\ell$ disjoint length-$\ell$ cycles on $n$ vertices, are partitioned randomly among $n$ players. Every edge is annota… ▽ More

    Submitted 19 September, 2022; v1 submitted 19 October, 2021; originally announced October 2021.

  13. arXiv:2107.07347  [pdf, other

    cs.DS

    Traversing the FFT Computation Tree for Dimension-Independent Sparse Fourier Transforms

    Authors: Karl Bringmann, Michael Kapralov, Mikhail Makarov, Vasileios Nakos, Amir Yagudin, Amir Zandieh

    Abstract: We consider the well-studied Sparse Fourier transform problem, where one aims to quickly recover an approximately Fourier $k$-sparse vector $\widehat{x} \in \mathbb{C}^{n^d}$ from observing its time domain representation $x$. In the exact $k$-sparse case the best known dimension-independent algorithm runs in near cubic time in $k$ and it is unclear whether a faster algorithm like in low dimensions… ▽ More

    Submitted 22 January, 2023; v1 submitted 15 July, 2021; originally announced July 2021.

  14. arXiv:2107.02578  [pdf, ps, other

    cs.DS

    Noisy Boolean Hidden Matching with Applications

    Authors: Michael Kapralov, Amulya Musipatla, Jakab Tardos, David P. Woodruff, Samson Zhou

    Abstract: The Boolean Hidden Matching (BHM) problem, introduced in a seminal paper of Gavinsky et. al. [STOC'07], has played an important role in the streaming lower bounds for graph problems such as triangle and subgraph counting, maximum matching, MAX-CUT, Schatten $p$-norm approximation, maximum acyclic subgraph, testing bipartiteness, $k$-connectivity, and cycle-freeness. The one-way communication compl… ▽ More

    Submitted 28 January, 2022; v1 submitted 6 July, 2021; originally announced July 2021.

  15. arXiv:2106.02353  [pdf, ps, other

    cs.DS

    Spectral Hypergraph Sparsifiers of Nearly Linear Size

    Authors: Michael Kapralov, Robert Krauthgamer, Jakab Tardos, Yuichi Yoshida

    Abstract: Graph sparsification has been studied extensively over the past two decades, culminating in spectral sparsifiers of optimal size (up to constant factors). Spectral hypergraph sparsification is a natural analogue of this problem, for which optimal bounds on the sparsifier size are not known, mainly because the hypergraph Laplacian is non-linear, and thus lacks the linear-algebraic structure and too… ▽ More

    Submitted 4 June, 2021; originally announced June 2021.

  16. arXiv:2103.11669  [pdf, ps, other

    cs.DS

    Space Lower Bounds for Approximating Maximum Matching in the Edge Arrival Model

    Authors: Michael Kapralov

    Abstract: The bipartite matching problem in the online and streaming settings has received a lot of attention recently. The classical vertex arrival setting, for which the celebrated Karp, Vazirani and Vazirani (KVV) algorithm achieves a $1-1/e$ approximation, is rather well understood: the $1-1/e$ approximation is optimal in both the online and semi-streaming setting, where the algorithm is constrained to… ▽ More

    Submitted 22 March, 2021; originally announced March 2021.

  17. arXiv:2101.05549  [pdf, ps, other

    cs.DS

    Spectral Clustering Oracles in Sublinear Time

    Authors: Grzegorz Gluch, Michael Kapralov, Silvio Lattanzi, Aida Mousavifar, Christian Sohler

    Abstract: Given a graph $G$ that can be partitioned into $k$ disjoint expanders with outer conductance upper bounded by $ε\ll 1$, can we efficiently construct a small space data structure that allows quickly classifying vertices of $G$ according to the expander (cluster) they belong to? Formally, we would like an efficient local computation algorithm that misclassifies at most an $O(ε)$ fraction of vertices… ▽ More

    Submitted 19 October, 2021; v1 submitted 14 January, 2021; originally announced January 2021.

    Comments: Proceedings of the 2021 ACM-SIAM Symposium on Discrete Algorithms (SODA). Society for Industrial and Applied Mathematics, 2021

  18. arXiv:2011.06997  [pdf, ps, other

    cs.DS

    Kernel Density Estimation through Density Constrained Near Neighbor Search

    Authors: Moses Charikar, Michael Kapralov, Navid Nouri, Paris Siminelakis

    Abstract: In this paper we revisit the kernel density estimation problem: given a kernel $K(x, y)$ and a dataset of $n$ points in high dimensional Euclidean space, prepare a data structure that can quickly output, given a query $q$, a $(1+ε)$-approximation to $μ:=\frac1{|P|}\sum_{p\in P} K(p, q)$. First, we give a single data structure based on classical near neighbor search techniques that improves upon or… ▽ More

    Submitted 13 November, 2020; originally announced November 2020.

  19. arXiv:2011.06530  [pdf, ps, other

    cs.DS

    Towards Tight Bounds for Spectral Sparsification of Hypergraphs

    Authors: Michael Kapralov, Robert Krauthgamer, Jakab Tardos, Yuichi Yoshida

    Abstract: Cut and spectral sparsification of graphs have numerous applications, including e.g. speeding up algorithms for cuts and Laplacian solvers. These powerful notions have recently been extended to hypergraphs, which are much richer and may offer new applications. However, the current bounds on the size of hypergraph sparsifiers are not as tight as the corresponding bounds for graphs. Our first resu… ▽ More

    Submitted 12 April, 2021; v1 submitted 12 November, 2020; originally announced November 2020.

  20. arXiv:2011.06481  [pdf, ps, other

    cs.DS

    Communication Efficient Coresets for Maximum Matching

    Authors: Michael Kapralov, Gilbert Maystre, Jakab Tardos

    Abstract: In this paper we revisit the problem of constructing randomized composable coresets for bipartite matching. In this problem the input graph is randomly partitioned across $k$ players, each of which sends a single message to a coordinator, who then must output a good approximation to the maximum matching in the input graph. Assadi and Khanna gave the first such coreset, achieving a $1/9$-approximat… ▽ More

    Submitted 12 November, 2020; originally announced November 2020.

  21. arXiv:2007.14204  [pdf, other

    cs.DS

    Graph Spanners by Sketching in Dynamic Streams and the Simultaneous Communication Model

    Authors: Arnold Filtser, Michael Kapralov, Navid Nouri

    Abstract: Graph sketching is a powerful technique introduced by the seminal work of Ahn, Guha and McGregor'12 on connectivity in dynamic graph streams that has enjoyed considerable attention in the literature since then, and has led to near optimal dynamic streaming algorithms for many fundamental problems such as connectivity, cut and spectral sparsifiers and matchings. Interestingly, however, the sketchin… ▽ More

    Submitted 12 November, 2020; v1 submitted 28 July, 2020; originally announced July 2020.

  22. arXiv:2003.09756  [pdf, ps, other

    stat.ML cs.DS cs.LG

    Scaling up Kernel Ridge Regression via Locality Sensitive Hashing

    Authors: Michael Kapralov, Navid Nouri, Ilya Razenshteyn, Ameya Velingker, Amir Zandieh

    Abstract: Random binning features, introduced in the seminal paper of Rahimi and Recht (2007), are an efficient method for approximating a kernel matrix using locality sensitive hashing. Random binning features provide a very simple and efficient way of approximating the Laplace kernel but unfortunately do not apply to many important classes of kernels, notably ones that generate smooth Gaussian processes,… ▽ More

    Submitted 21 March, 2020; originally announced March 2020.

  23. arXiv:1912.12740  [pdf, other

    cs.DC cs.DB cs.DS cs.PF

    Practice of Streaming Processing of Dynamic Graphs: Concepts, Models, and Systems

    Authors: Maciej Besta, Marc Fischer, Vasiliki Kalavri, Michael Kapralov, Torsten Hoefler

    Abstract: Graph processing has become an important part of various areas of computing, including machine learning, medical applications, social network analysis, computational sciences, and others. A growing amount of the associated graph processing workloads are dynamic, with millions of edges added or removed per second. Graph streaming frameworks are specifically crafted to enable the processing of such… ▽ More

    Submitted 27 October, 2021; v1 submitted 29 December, 2019; originally announced December 2019.

    Journal ref: IEEE Transactions on Parallel and Distributed Systems (TPDS), 2022

  24. arXiv:1909.01410  [pdf, ps, other

    cs.DS

    Oblivious Sketching of High-Degree Polynomial Kernels

    Authors: Thomas D. Ahle, Michael Kapralov, Jakob B. T. Knudsen, Rasmus Pagh, Ameya Velingker, David Woodruff, Amir Zandieh

    Abstract: Kernel methods are fundamental tools in machine learning that allow detection of non-linear dependencies between data without explicitly constructing feature vectors in high dimensional spaces. A major disadvantage of kernel methods is their poor scalability: primitives such as kernel PCA or kernel ridge regression generally take prohibitively large quadratic space and (at least) quadratic time, a… ▽ More

    Submitted 22 December, 2020; v1 submitted 3 September, 2019; originally announced September 2019.

  25. arXiv:1907.05725  [pdf, other

    cs.DS

    Space Efficient Approximation to Maximum Matching Size from Uniform Edge Samples

    Authors: Michael Kapralov, Slobodan Mitrović, Ashkan Norouzi-Fard, Jakab Tardos

    Abstract: Given a source of iid samples of edges of an input graph $G$ with $n$ vertices and $m$ edges, how many samples does one need to compute a constant factor approximation to the maximum matching size in $G$? Moreover, is it possible to obtain such an estimate in a small amount of space? We show that, on the one hand, this problem cannot be solved using a nontrivially sublinear (in $m$) number of samp… ▽ More

    Submitted 12 July, 2019; originally announced July 2019.

  26. arXiv:1904.08255  [pdf, ps, other

    cs.DS

    Online Matching with General Arrivals

    Authors: Buddhima Gamlath, Michael Kapralov, Andreas Maggiori, Ola Svensson, David Wajc

    Abstract: The online matching problem was introduced by Karp, Vazirani and Vazirani nearly three decades ago. In that seminal work, they studied this problem in bipartite graphs with vertices arriving only on one side, and presented optimal deterministic and randomized algorithms for this setting. In comparison, more general arrival models, such as edge arrivals and general vertex arrivals, have proven more… ▽ More

    Submitted 17 April, 2019; originally announced April 2019.

  27. arXiv:1903.12165  [pdf, ps, other

    cs.DS

    Faster Spectral Sparsification in Dynamic Streams

    Authors: Michael Kapralov, Aida Mousavifar, Cameron Musco, Christopher Musco, Navid Nouri

    Abstract: Graph sketching has emerged as a powerful technique for processing massive graphs that change over time (i.e., are presented as a dynamic stream of edge updates) over the past few years, starting with the work of Ahn, Guha and McGregor (SODA'12) on graph connectivity via sketching. In this paper we consider the problem of designing spectral approximations to graphs, or spectral sparsifiers, using… ▽ More

    Submitted 28 March, 2019; originally announced March 2019.

  28. arXiv:1903.12150  [pdf, ps, other

    cs.DS

    Dynamic Streaming Spectral Sparsification in Nearly Linear Time and Space

    Authors: Michael Kapralov, Navid Nouri, Aaron Sidford, Jakab Tardos

    Abstract: In this paper we consider the problem of computing spectral approximations to graphs in the single pass dynamic streaming model. We provide a linear sketching based solution that given a stream of edge insertions and deletions to a $n$-node undirected graph, uses $\tilde O(n)$ space, processes each update in $\tilde O(1)$ time, and with high probability recovers a spectral sparsifier in… ▽ More

    Submitted 28 March, 2019; originally announced March 2019.

  29. arXiv:1902.10633  [pdf, ps, other

    cs.DS

    Dimension-independent Sparse Fourier Transform

    Authors: Michael Kapralov, Ameya Velingker, Amir Zandieh

    Abstract: The Discrete Fourier Transform (DFT) is a fundamental computational primitive, and the fastest known algorithm for computing the DFT is the FFT (Fast Fourier Transform) algorithm. One remarkable feature of FFT is the fact that its runtime depends only on the size $N$ of the input vector, but not on the dimensionality of the input domain: FFT runs in time $O(N\log N)$ irrespective of whether the DF… ▽ More

    Submitted 27 February, 2019; originally announced February 2019.

  30. arXiv:1812.08723  [pdf, ps, other

    cs.DS cs.LG eess.SP math.NA

    A Universal Sampling Method for Reconstructing Signals with Simple Fourier Transforms

    Authors: Haim Avron, Michael Kapralov, Cameron Musco, Christopher Musco, Ameya Velingker, Amir Zandieh

    Abstract: Reconstructing continuous signals from a small number of discrete samples is a fundamental problem across science and engineering. In practice, we are often interested in signals with 'simple' Fourier structure, such as bandlimited, multiband, and Fourier sparse signals. More broadly, any prior knowledge about a signal's Fourier power spectrum can constrain its complexity. Intuitively, signals wit… ▽ More

    Submitted 20 December, 2018; originally announced December 2018.

  31. arXiv:1811.10879  [pdf, ps, other

    cs.DS

    An Optimal Space Lower Bound for Approximating MAX-CUT

    Authors: Michael Kapralov, Dmitry Krachun

    Abstract: We consider the problem of estimating the value of MAX-CUT in a graph in the streaming model of computation. At one extreme, there is a trivial $2$-approximation for this problem that uses only $O(\log n)$ space, namely, count the number of edges and output half of this value as the estimate for the size of the MAX-CUT. On the other extreme, for any fixed $ε> 0$, if one allows $\tilde{O}(n)$ space… ▽ More

    Submitted 27 November, 2018; originally announced November 2018.

  32. arXiv:1811.07780  [pdf, ps, other

    cs.DS

    A Simple Sublinear-Time Algorithm for Counting Arbitrary Subgraphs via Edge Sampling

    Authors: Sepehr Assadi, Michael Kapralov, Sanjeev Khanna

    Abstract: In the subgraph counting problem, we are given a input graph $G(V, E)$ and a target graph $H$; the goal is to estimate the number of occurrences of $H$ in $G$. Our focus here is on designing sublinear-time algorithms for approximately counting occurrences of $H$ in $G$ in the setting where the algorithm is given query access to $G$. This problem has been studied in several recent papers which prim… ▽ More

    Submitted 19 November, 2018; originally announced November 2018.

  33. arXiv:1808.04995  [pdf, other

    cs.DS

    The Sketching Complexity of Graph and Hypergraph Counting

    Authors: John Kallaugher, Michael Kapralov, Eric Price

    Abstract: Subgraph counting is a fundamental primitive in graph processing, with applications in social network analysis (e.g., estimating the clustering coefficient of a graph), database processing and other areas. The space complexity of subgraph counting has been studied extensively in the literature, but many natural settings are still not well understood. In this paper we revisit the subgraph (and hype… ▽ More

    Submitted 15 August, 2018; originally announced August 2018.

  34. arXiv:1808.04807  [pdf, ps, other

    cs.DS

    Testing Graph Clusterability: Algorithms and Lower Bounds

    Authors: Ashish Chiplunkar, Michael Kapralov, Sanjeev Khanna, Aida Mousavifar, Yuval Peres

    Abstract: We consider the problem of testing graph cluster structure: given access to a graph $G=(V, E)$, can we quickly determine whether the graph can be partitioned into a few clusters with good inner conductance, or is far from any such graph? This is a generalization of the well-studied problem of testing graph expansion, where one wants to distinguish between the graph having good expansion (i.e.\ bei… ▽ More

    Submitted 18 September, 2018; v1 submitted 14 August, 2018; originally announced August 2018.

    Comments: Appears in FOCS 2018

  35. arXiv:1804.09893  [pdf, other

    cs.LG cs.DS math.NA stat.ML

    Random Fourier Features for Kernel Ridge Regression: Approximation Bounds and Statistical Guarantees

    Authors: Haim Avron, Michael Kapralov, Cameron Musco, Christopher Musco, Ameya Velingker, Amir Zandieh

    Abstract: Random Fourier features is one of the most popular techniques for scaling up kernel methods, such as kernel ridge regression. However, despite impressive empirical results, the statistical properties of random Fourier features are still not well understood. In this paper we take steps toward filling this gap. Specifically, we approach random Fourier features from a spectral matrix approximation po… ▽ More

    Submitted 21 May, 2018; v1 submitted 26 April, 2018; originally announced April 2018.

    Comments: An extended abstract of this work appears in the Proceedings of the 34th International Conference on Machine Learning (ICML 2017)

  36. arXiv:1708.04544  [pdf, ps, other

    cs.DS

    Sample Efficient Estimation and Recovery in Sparse FFT via Isolation on Average

    Authors: Michael Kapralov

    Abstract: The problem of computing the Fourier Transform of a signal whose spectrum is dominated by a small number $k$ of frequencies quickly and using a small number of samples of the signal in time domain (the Sparse FFT problem) has received significant attention recently. It is known how to approximately compute the $k$-sparse Fourier transform in $\approx k\log^2 n$ time [Hassanieh et al'STOC'12], or u… ▽ More

    Submitted 17 August, 2017; v1 submitted 15 August, 2017; originally announced August 2017.

  37. arXiv:1704.00633  [pdf, ps, other

    cs.CC cs.DS

    Optimal lower bounds for universal relation, and for samplers and finding duplicates in streams

    Authors: Michael Kapralov, Jelani Nelson, Jakub Pachocki, Zhengyu Wang, David P. Woodruff, Mobin Yahyazadeh

    Abstract: In the communication problem $\mathbf{UR}$ (universal relation) [KRW95], Alice and Bob respectively receive $x, y \in\{0,1\}^n$ with the promise that $x\neq y$. The last player to receive a message must output an index $i$ such that $x_i\neq y_i$. We prove that the randomized one-way communication complexity of this problem in the public coin model is exactly… ▽ More

    Submitted 3 April, 2017; originally announced April 2017.

    Comments: merge of arXiv:1703.08139 and of work of Kapralov, Woodruff, and Yahyazadeh

  38. arXiv:1702.01286  [pdf, other

    cs.DS

    An Adaptive Sublinear-Time Block Sparse Fourier Transform

    Authors: Volkan Cevher, Michael Kapralov, Jonathan Scarlett, Amir Zandieh

    Abstract: The problem of approximately computing the $k$ dominant Fourier coefficients of a vector $X$ quickly, and using few samples in time domain, is known as the Sparse Fourier Transform (sparse FFT) problem. A long line of work on the sparse FFT has resulted in algorithms with $O(k\log n\log (n/k))$ runtime [Hassanieh et al., STOC'12] and $O(k\log n)$ sample complexity [Indyk et al., FOCS'14]. These re… ▽ More

    Submitted 11 April, 2017; v1 submitted 4 February, 2017; originally announced February 2017.

  39. arXiv:1606.05732  [pdf, other

    cs.DS

    How to Fake Multiply by a Gaussian Matrix

    Authors: Michael Kapralov, Vamsi K. Potluru, David P. Woodruff

    Abstract: Have you ever wanted to multiply an $n \times d$ matrix $X$, with $n \gg d$, on the left by an $m \times n$ matrix $\tilde G$ of i.i.d. Gaussian random variables, but could not afford to do it because it was too slow? In this work we propose a new randomized $m \times n$ matrix $T$, for which one can compute $T \cdot X$ in only $O(\text{nnz}(X)) + \tilde O(m^2 \cdot d^{3})$ time, for which the tot… ▽ More

    Submitted 9 December, 2020; v1 submitted 18 June, 2016; originally announced June 2016.

  40. arXiv:1604.00845  [pdf, ps, other

    cs.DS

    Sparse Fourier Transform in Any Constant Dimension with Nearly-Optimal Sample Complexity in Sublinear Time

    Authors: Michael Kapralov

    Abstract: We consider the problem of computing a $k$-sparse approximation to the Fourier transform of a length $N$ signal. Our main result is a randomized algorithm for computing such an approximation (i.e. achieving the $\ell_2/\ell_2$ sparse recovery guarantees using Fourier measurements) using $O_d(k\log N\log\log N)$ samples of the signal in time domain that runs in time $O_d(k\log^{d+3} N)$, where… ▽ More

    Submitted 4 April, 2016; originally announced April 2016.

  41. arXiv:1602.04478  [pdf, other

    cs.DC cs.DS

    Subgraph Counting: Color Coding Beyond Trees

    Authors: Venkatesan T. Chakaravarthy, Michael Kapralov, Prakash Murali, Fabrizio Petrini, Xinyu Que, Yogish Sabharwal, Baruch Schieber

    Abstract: The problem of counting occurrences of query graphs in a large data graph, known as subgraph counting, is fundamental to several domains such as genomics and social network analysis. Many important special cases (e.g. triangle counting) have received significant attention. Color coding is a very general and powerful algorithmic technique for subgraph counting. Color coding has been shown to be eff… ▽ More

    Submitted 2 April, 2016; v1 submitted 14 February, 2016; originally announced February 2016.

  42. arXiv:1409.2138  [pdf, ps, other

    cs.DS

    Streaming Lower Bounds for Approximating MAX-CUT

    Authors: Michael Kapralov, Sanjeev Khanna, Madhu Sudan

    Abstract: We consider the problem of estimating the value of max cut in a graph in the streaming model of computation. At one extreme, there is a trivial $2$-approximation for this problem that uses only $O(\log n)$ space, namely, count the number of edges and output half of this value as the estimate for max cut value. On the other extreme, if one allows $\tilde{O}(n)$ space, then a near-optimal solution t… ▽ More

    Submitted 7 September, 2014; originally announced September 2014.

  43. arXiv:1407.1289  [pdf, ps, other

    cs.DS

    Single Pass Spectral Sparsification in Dynamic Streams

    Authors: Michael Kapralov, Yin Tat Lee, Cameron Musco, Christopher Musco, Aaron Sidford

    Abstract: We present the first single pass algorithm for computing spectral sparsifiers of graphs in the dynamic semi-streaming model. Given a single pass over a stream containing insertions and deletions of edges to a graph G, our algorithm maintains a randomized linear sketch of the incidence matrix of G into dimension O((1/epsilon^2) n polylog(n)). Using this sketch, at any point, the algorithm can outpu… ▽ More

    Submitted 15 April, 2015; v1 submitted 4 July, 2014; originally announced July 2014.

  44. arXiv:1403.5804  [pdf, other

    cs.DS

    Sample-Optimal Fourier Sampling in Any Constant Dimension -- Part I

    Authors: Piotr Indyk, Michael Kapralov

    Abstract: We give an algorithm for $\ell_2/\ell_2$ sparse recovery from Fourier measurements using $O(k\log N)$ samples, matching the lower bound of \cite{DIPW} for non-adaptive algorithms up to constant factors for any $k\leq N^{1-δ}$. The algorithm runs in $\tilde O(N)$ time. Our algorithm extends to higher dimensions, leading to sample complexity of $O_d(k\log N)$, which is optimal up to constant factors… ▽ More

    Submitted 12 May, 2014; v1 submitted 23 March, 2014; originally announced March 2014.

  45. arXiv:1206.2269  [pdf, ps, other

    cs.DS

    Better bounds for matchings in the streaming model

    Authors: Michael Kapralov

    Abstract: In this paper we present improved bounds for approximating maximum matchings in bipartite graphs in the streaming model. First, we consider the question of how well maximum matching can be approximated in a single pass over the input using $\tilde O(n)$ space, where $n$ is the number of vertices in the input graph. Two natural variants of this problem have been considered in the literature: (1) th… ▽ More

    Submitted 17 March, 2021; v1 submitted 11 June, 2012; originally announced June 2012.

  46. arXiv:1204.1025  [pdf, ps, other

    cs.DS

    Online submodular welfare maximization: Greedy is optimal

    Authors: Michael Kapralov, Ian Post, Jan Vondrak

    Abstract: We prove that no online algorithm (even randomized, against an oblivious adversary) is better than 1/2-competitive for welfare maximization with coverage valuations, unless $NP = RP$. Since the Greedy algorithm is known to be 1/2-competitive for monotone submodular valuations, of which coverage is a special case, this proves that Greedy provides the optimal competitive ratio. On the other hand, we… ▽ More

    Submitted 29 January, 2013; v1 submitted 4 April, 2012; originally announced April 2012.

  47. arXiv:1203.4900  [pdf, ps, other

    cs.DS

    Single pass sparsification in the streaming model with edge deletions

    Authors: Ashish Goel, Michael Kapralov, Ian Post

    Abstract: In this paper we give a construction of cut sparsifiers of Benczur and Karger in the {\em dynamic} streaming setting in a single pass over the data stream. Previous constructions either required multiple passes or were unable to handle edge deletions. We use $\tilde{O}(1/\e^2)$ time for each stream update and $\tilde{O}(n/\e^2)$ time to construct a sparsifier. Our $\e$-sparsifiers have… ▽ More

    Submitted 22 March, 2012; originally announced March 2012.

  48. arXiv:1202.3683  [pdf, ps, other

    cs.NI

    Optimal bandwidth-aware VM allocation for Infrastructure-as-a-Service

    Authors: Debojyoti Dutta, Michael Kapralov, Ian Post, Rajendra Shinde

    Abstract: Infrastructure-as-a-Service (IaaS) providers need to offer richer services to be competitive while optimizing their resource usage to keep costs down. Richer service offerings include new resource request models involving bandwidth guarantees between virtual machines (VMs). Thus we consider the following problem: given a VM request graph (where nodes are VMs and edges represent virtual network con… ▽ More

    Submitted 16 February, 2012; originally announced February 2012.

  49. arXiv:1008.3672  [pdf, ps, other

    cs.DS

    Prediction strategies without loss

    Authors: Michael Kapralov, Rina Panigrahy

    Abstract: Consider a sequence of bits where we are trying to predict the next bit from the previous bits. Assume we are allowed to say 'predict 0' or 'predict 1', and our payoff is +1 if the prediction is correct and -1 otherwise. We will say that at each point in time the loss of an algorithm is the number of wrong predictions minus the number of right predictions so far. In this paper we are interested in… ▽ More

    Submitted 10 October, 2012; v1 submitted 21 August, 2010; originally announced August 2010.

  50. arXiv:1004.4915  [pdf, ps, other

    cs.DS

    Graph Sparsification via Refinement Sampling

    Authors: Ashish Goel, Michael Kapralov, Sanjeev Khanna

    Abstract: A graph G'(V,E') is an \eps-sparsification of G for some \eps>0, if every (weighted) cut in G' is within (1\pm \eps) of the corresponding cut in G. A celebrated result of Benczur and Karger shows that for every undirected graph G, an \eps-sparsification with O(n\log n/\e^2) edges can be constructed in O(m\log^2n) time. Applications to modern massive data sets often constrain algorithms to use co… ▽ More

    Submitted 27 April, 2010; originally announced April 2010.