Enayat Ullah

IIT Kanpur, Mathematics and Scientific Computing, Undergraduate

Followers

182

Following

171

Public Views

Uploads

Papers

Communication-efficient distributed SGD with Sketching

arXiv (Cornell University), Mar 12, 2019

Download

Private Stochastic Convex Optimization: Efficient Algorithms for Non-smooth Objectives

arXiv (Cornell University), Feb 21, 2020

Download

Adversarial Robustness is at Odds with Lazy Training

arXiv (Cornell University), Jun 17, 2022

Download

FetchSGD: Communication-Efficient Federated Learning with Sketching

arXiv (Cornell University), Jul 15, 2020

Download

Differentially Private Generalized Linear Models Revisited

arXiv (Cornell University), May 6, 2022

Download

Faster Rates of Convergence to Stationary Points in Differentially Private Optimization

arXiv (Cornell University), Jun 1, 2022

Download

Machine Unlearning via Algorithmic Stability

arXiv (Cornell University), Feb 25, 2021

Download

Communication-Efficient Federated Learning with Sketching

Existing approaches to federated learning suffer from a communication bottleneck as well as conve... more Existing approaches to federated learning suffer from a communication bottleneck as well as convergence issues due to sparse client participation. In this paper we introduce a novel algorithm, called FedSketchedSGD, to overcome these challenges. FedSketchedSGD compresses model updates using a Count Sketch, and then takes advantage of the mergeability of sketches to combine model updates from many workers. A key insight in the design of FedSketchedSGD is that, because the Count Sketch is linear, momentum and error accumulation can both be carried out within the sketch. This allows the algorithm to move momentum and error accumulation from clients to the central aggregator, overcoming the challenges of sparse client participation while still achieving high compression rates. We prove that FedSketchedSGD has favorable convergence guarantees, and we demonstrate its empirical effectiveness by training two residual networks and a transformer model.

Download

Improved Algorithms for Time Decay Streams

ArXiv, 2019

In the time-decay model for data streams, elements of an underlying data set arrive sequentially ... more In the time-decay model for data streams, elements of an underlying data set arrive sequentially with the recently arrived elements being more important. A common approach for handling large data sets is to maintain a \emph{coreset}, a succinct summary of the processed data that allows approximate recovery of a predetermined query. We provide a general framework that takes any offline-coreset and gives a time-decay coreset for polynomial time decay functions. We also consider the exponential time decay model for $k$-median clustering, where we provide a constant factor approximation algorithm that utilizes the online facility location algorithm. Our algorithm stores $\mathcal{O}(k\log(h\Delta)+h)$ points where $h$ is the half-life of the decay function and $\Delta$ is the aspect ratio of the dataset. Our techniques extend to $k$-means clustering and $M$-estimators as well.

Download

Streaming Kernel PCA with Õ(√n) Random Features

ArXiv, 2018

We study the statistical and computational aspects of kernel principal component analysis using r... more We study the statistical and computational aspects of kernel principal component analysis using random Fourier features and show that under mild assumptions, O( √ n log (n)) features suffice to achieve O(1/ ) sample complexity. Furthermore, we give a memory efficient streaming algorithm based on classical Oja’s algorithm that achieves this rate.

Download