Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–10 of 10 results for author: Pillutla, K

Searching in archive stat. Search in all archives.
.
  1. arXiv:2310.13863  [pdf, other

    stat.ML cs.LG math.OC

    Distributionally Robust Optimization with Bias and Variance Reduction

    Authors: Ronak Mehta, Vincent Roulet, Krishna Pillutla, Zaid Harchaoui

    Abstract: We consider the distributionally robust optimization (DRO) problem with spectral risk-based uncertainty set and $f$-divergence penalty. This formulation includes common risk-sensitive learning objectives such as regularized condition value-at-risk (CVaR) and average top-$k$ loss. We present Prospect, a stochastic gradient-based algorithm that only requires tuning a single learning rate hyperparame… ▽ More

    Submitted 20 October, 2023; originally announced October 2023.

  2. arXiv:2212.05149  [pdf, other

    stat.ML cs.LG math.OC

    Stochastic Optimization for Spectral Risk Measures

    Authors: Ronak Mehta, Vincent Roulet, Krishna Pillutla, Lang Liu, Zaid Harchaoui

    Abstract: Spectral risk objectives - also called $L$-risks - allow for learning systems to interpolate between optimizing average-case performance (as in empirical risk minimization) and worst-case performance on a task. We develop stochastic algorithms to optimize these quantities by characterizing their subdifferential and addressing challenges such as biasedness of subgradient estimates and non-smoothnes… ▽ More

    Submitted 9 December, 2022; originally announced December 2022.

  3. arXiv:2212.04014  [pdf, other

    stat.ML cs.LG math.ST

    Statistical and Computational Guarantees for Influence Diagnostics

    Authors: Jillian Fisher, Lang Liu, Krishna Pillutla, Yejin Choi, Zaid Harchaoui

    Abstract: Influence diagnostics such as influence functions and approximate maximum influence perturbations are popular in machine learning and in AI domain applications. Influence diagnostics are powerful statistical tools to identify influential datapoints or subsets of datapoints. We establish finite-sample statistical bounds, as well as computational complexity bounds, for influence functions and approx… ▽ More

    Submitted 19 September, 2023; v1 submitted 7 December, 2022; originally announced December 2022.

    Comments: For AISTATS 2023. Software see https://github.com/jfisher52/influence_theory

  4. arXiv:2112.09429  [pdf, other

    cs.LG math.OC stat.ML

    Federated Learning with Superquantile Aggregation for Heterogeneous Data

    Authors: Krishna Pillutla, Yassine Laguel, Jérôme Malick, Zaid Harchaoui

    Abstract: We present a federated learning framework that is designed to robustly deliver good predictive performance across individual clients with heterogeneous data. The proposed approach hinges upon a superquantile-based learning objective that captures the tail statistics of the error distribution over heterogeneous clients. We present a stochastic training algorithm that interleaves differentially priv… ▽ More

    Submitted 6 December, 2022; v1 submitted 17 December, 2021; originally announced December 2021.

    Comments: Machine Learning Journal, Special Issue on Safe and Fair Machine Learning (To appear)

    Journal ref: Machine Learning (2023): 1-68

  5. arXiv:2106.07898  [pdf, other

    stat.ML cs.LG

    Divergence Frontiers for Generative Models: Sample Complexity, Quantization Effects, and Frontier Integrals

    Authors: Lang Liu, Krishna Pillutla, Sean Welleck, Sewoong Oh, Yejin Choi, Zaid Harchaoui

    Abstract: The spectacular success of deep generative models calls for quantitative tools to measure their statistical performance. Divergence frontiers have recently been proposed as an evaluation framework for generative models, due to their ability to measure the quality-diversity trade-off inherent to deep generative modeling. We establish non-asymptotic bounds on the sample complexity of divergence fron… ▽ More

    Submitted 11 December, 2021; v1 submitted 15 June, 2021; originally announced June 2021.

  6. arXiv:2002.11223  [pdf, other

    stat.ML cs.DC cs.LG math.OC

    Device Heterogeneity in Federated Learning: A Superquantile Approach

    Authors: Yassine Laguel, Krishna Pillutla, Jérôme Malick, Zaid Harchaoui

    Abstract: We propose a federated learning framework to handle heterogeneous client devices which do not conform to the population data distribution. The approach hinges upon a parameterized superquantile-based objective, where the parameter ranges over levels of conformity. We present an optimization algorithm and establish its convergence to a stationary point. We show how to practically implement it using… ▽ More

    Submitted 25 February, 2020; originally announced February 2020.

    Journal ref: Machine Learning (2023): 1-68

  7. arXiv:1912.13445  [pdf, other

    stat.ML cs.CR cs.LG

    Robust Aggregation for Federated Learning

    Authors: Krishna Pillutla, Sham M. Kakade, Zaid Harchaoui

    Abstract: Federated learning is the centralized training of statistical models from decentralized data on mobile devices while preserving the privacy of each device. We present a robust aggregation approach to make federated learning robust to settings when a fraction of the devices may be sending corrupted updates to the server. The approach relies on a robust aggregation oracle based on the geometric medi… ▽ More

    Submitted 17 January, 2022; v1 submitted 31 December, 2019; originally announced December 2019.

    Journal ref: IEEE Transactions on Signal Processing 70 (2022): 1142-1154

  8. arXiv:1902.03228  [pdf, other

    stat.ML cs.LG math.OC

    A Smoother Way to Train Structured Prediction Models

    Authors: Krishna Pillutla, Vincent Roulet, Sham M. Kakade, Zaid Harchaoui

    Abstract: We present a framework to train a structured prediction model by performing smoothing on the inference algorithm it builds upon. Smoothing overcomes the non-smoothness inherent to the maximum margin structured prediction objective, and paves the way for the use of fast primal gradient-based optimization algorithms. We illustrate the proposed framework by developing a novel primal incremental optim… ▽ More

    Submitted 8 February, 2019; originally announced February 2019.

    Comments: Short version appeared in Neural Information Processing Systems (NeurIPS) 2018

  9. arXiv:1710.09430  [pdf, ps, other

    stat.ML cs.LG math.OC

    A Markov Chain Theory Approach to Characterizing the Minimax Optimality of Stochastic Gradient Descent (for Least Squares)

    Authors: Prateek Jain, Sham M. Kakade, Rahul Kidambi, Praneeth Netrapalli, Venkata Krishna Pillutla, Aaron Sidford

    Abstract: This work provides a simplified proof of the statistical minimax optimality of (iterate averaged) stochastic gradient descent (SGD), for the special case of least squares. This result is obtained by analyzing SGD as a stochastic process and by sharply characterizing the stationary covariance matrix of this process. The finite rate optimality characterization captures the constant factors and addre… ▽ More

    Submitted 21 July, 2018; v1 submitted 25 October, 2017; originally announced October 2017.

    Comments: Lemma 1 has been updated in v2

  10. arXiv:1512.04848  [pdf, other

    cs.LG cs.DS stat.ML

    Data Driven Resource Allocation for Distributed Learning

    Authors: Travis Dick, Mu Li, Venkata Krishna Pillutla, Colin White, Maria Florina Balcan, Alex Smola

    Abstract: In distributed machine learning, data is dispatched to multiple machines for processing. Motivated by the fact that similar data points often belong to the same or similar classes, and more generally, classification rules of high accuracy tend to be "locally simple but globally complex" (Vapnik & Bottou 1993), we propose data dependent dispatching that takes advantage of such structure. We present… ▽ More

    Submitted 15 December, 2016; v1 submitted 15 December, 2015; originally announced December 2015.