Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–13 of 13 results for author: Fallah, A

Searching in archive stat. Search in all archives.
.
  1. arXiv:2106.13756  [pdf, other

    cs.LG cs.CR math.OC stat.ML

    Private Adaptive Gradient Methods for Convex Optimization

    Authors: Hilal Asi, John Duchi, Alireza Fallah, Omid Javidbakht, Kunal Talwar

    Abstract: We study adaptive methods for differentially private convex optimization, proposing and analyzing differentially private variants of a Stochastic Gradient Descent (SGD) algorithm with adaptive stepsizes, as well as the AdaGrad algorithm. We provide upper bounds on the regret of both algorithms and show that the bounds are (worst-case) optimal. As a consequence of our development, we show that our… ▽ More

    Submitted 25 June, 2021; originally announced June 2021.

    Comments: To appear in 38th International Conference on Machine Learning (ICML 2021)

  2. arXiv:2106.07537  [pdf, other

    stat.ML cs.LG math.OC

    A Wasserstein Minimax Framework for Mixed Linear Regression

    Authors: Theo Diamandis, Yonina C. Eldar, Alireza Fallah, Farzan Farnia, Asuman Ozdaglar

    Abstract: Multi-modal distributions are commonly used to model clustered data in statistical learning tasks. In this paper, we consider the Mixed Linear Regression (MLR) problem. We propose an optimal transport-based framework for MLR problems, Wasserstein Mixed Linear Regression (WMLR), which minimizes the Wasserstein distance between the learned and target mixture regression models. Through a model-based… ▽ More

    Submitted 16 June, 2021; v1 submitted 14 June, 2021; originally announced June 2021.

    Comments: To appear in 38th International Conference on Machine Learning (ICML 2021)

  3. arXiv:2105.09893  [pdf, other

    stat.ME stat.AP

    A flexible Bayesian non-confounding spatial model for analysis of dispersed count data in clinical studies

    Authors: Mahsa Nadifar, Hossein Baghishani, Afshin Fallah

    Abstract: In employing spatial regression models for counts, we usually meet two issues. First, ignoring the inherent collinearity between covariates and the spatial effect would lead to causal inferences. Second, real count data usually reveal over or under-dispersion where the classical Poisson model is not appropriate to use. We propose a flexible Bayesian hierarchical modeling approach by joining non-co… ▽ More

    Submitted 20 May, 2021; originally announced May 2021.

    Comments: arXiv admin note: text overlap with arXiv:1908.02344

  4. arXiv:2105.08686  [pdf, other

    stat.ME stat.AP

    Flexible Bayesian Modeling of Counts: Constructing Penalized Complexity Priors

    Authors: Mahsa Nadifar, Hossein Baghishani, Thomas Kneib, Afshin Fallah

    Abstract: Many of the data, particularly in medicine and disease mapping are count. Indeed, the under or overdispersion problem in count data distrusts the performance of the classical Poisson model. For taking into account this problem, in this paper, we introduce a new Bayesian structured additive regression model, called gamma count, with enough flexibility in modeling dispersion. Setting convenient prio… ▽ More

    Submitted 18 May, 2021; originally announced May 2021.

  5. arXiv:2102.03832  [pdf, other

    cs.LG math.OC stat.ML

    Generalization of Model-Agnostic Meta-Learning Algorithms: Recurring and Unseen Tasks

    Authors: Alireza Fallah, Aryan Mokhtari, Asuman Ozdaglar

    Abstract: In this paper, we study the generalization properties of Model-Agnostic Meta-Learning (MAML) algorithms for supervised learning problems. We focus on the setting in which we train the MAML model over $m$ tasks, each with $n$ data points, and characterize its generalization error from two points of view: First, we assume the new task at test time is one of the training tasks, and we show that, for… ▽ More

    Submitted 16 November, 2021; v1 submitted 7 February, 2021; originally announced February 2021.

    Comments: 35th Conference on Neural Information Processing Systems (NeurIPS 2021)

  6. arXiv:2002.07948  [pdf, other

    cs.LG math.OC stat.ML

    Personalized Federated Learning: A Meta-Learning Approach

    Authors: Alireza Fallah, Aryan Mokhtari, Asuman Ozdaglar

    Abstract: In Federated Learning, we aim to train models across multiple computing units (users), while users can only communicate with a common central server, without exchanging their data samples. This mechanism exploits the computational power of all users and allows users to obtain a richer model as their models are trained over a larger set of data points. However, this scheme only develops a common ou… ▽ More

    Submitted 22 October, 2020; v1 submitted 18 February, 2020; originally announced February 2020.

    Comments: To appear in 34th Conference on Neural Information Processing Systems (NeurIPS 2020)

  7. arXiv:2002.05683  [pdf, ps, other

    math.OC cs.LG stat.ML

    An Optimal Multistage Stochastic Gradient Method for Minimax Problems

    Authors: Alireza Fallah, Asuman Ozdaglar, Sarath Pattathil

    Abstract: In this paper, we study the minimax optimization problem in the smooth and strongly convex-strongly concave setting when we have access to noisy estimates of gradients. In particular, we first analyze the stochastic Gradient Descent Ascent (GDA) method with constant stepsize, and show that it converges to a neighborhood of the solution of the minimax problem. We further provide tight bounds on the… ▽ More

    Submitted 13 February, 2020; originally announced February 2020.

  8. arXiv:2002.05135  [pdf, other

    cs.LG math.OC stat.ML

    On the Convergence Theory of Debiased Model-Agnostic Meta-Reinforcement Learning

    Authors: Alireza Fallah, Kristian Georgiev, Aryan Mokhtari, Asuman Ozdaglar

    Abstract: We consider Model-Agnostic Meta-Learning (MAML) methods for Reinforcement Learning (RL) problems, where the goal is to find a policy using data from several tasks represented by Markov Decision Processes (MDPs) that can be updated by one step of stochastic policy gradient for the realized MDP. In particular, using stochastic gradients in MAML update steps is crucial for RL problems since computati… ▽ More

    Submitted 16 November, 2021; v1 submitted 12 February, 2020; originally announced February 2020.

    Comments: 35th Conference on Neural Information Processing Systems (NeurIPS 2021)

  9. arXiv:1910.08701  [pdf, other

    math.OC cs.LG stat.ML

    Robust Distributed Accelerated Stochastic Gradient Methods for Multi-Agent Networks

    Authors: Alireza Fallah, Mert Gurbuzbalaban, Asuman Ozdaglar, Umut Simsekli, Lingjiong Zhu

    Abstract: We study distributed stochastic gradient (D-SG) method and its accelerated variant (D-ASG) for solving decentralized strongly convex stochastic optimization problems where the objective function is distributed over several computational units, lying on a fixed but arbitrary connected communication graph, subject to local communication constraints where noisy estimates of the gradients are availabl… ▽ More

    Submitted 4 October, 2021; v1 submitted 19 October, 2019; originally announced October 2019.

  10. arXiv:1908.10400  [pdf, other

    cs.LG math.OC stat.ML

    On the Convergence Theory of Gradient-Based Model-Agnostic Meta-Learning Algorithms

    Authors: Alireza Fallah, Aryan Mokhtari, Asuman Ozdaglar

    Abstract: We study the convergence of a class of gradient-based Model-Agnostic Meta-Learning (MAML) methods and characterize their overall complexity as well as their best achievable accuracy in terms of gradient norm for nonconvex loss functions. We start with the MAML method and its first-order approximation (FO-MAML) and highlight the challenges that emerge in their analysis. By overcoming these challeng… ▽ More

    Submitted 15 May, 2020; v1 submitted 27 August, 2019; originally announced August 2019.

    Comments: To appear in the proceedings of the $23^{rd}$ International Conference on Artificial Intelligence and Statistics (AISTATS) 2020

  11. arXiv:1908.02344  [pdf, other

    stat.ME stat.AP

    Statistical modeling of groundwater quality assessment in Iran using a flexible Poisson likelihood

    Authors: Mahsa Nadifar, Hossein Baghishani, Afshin Fallah, Havard Rue

    Abstract: Assessing water quality and recognizing its associated risks to human health and the broader environment is undoubtedly essential. Groundwater is widely used to supply water for drinking, industry, and agriculture purposes. The groundwater quality measurements vary for different climates and various human behaviors, and consequently, their spatial variability can be substantial. In this paper, we… ▽ More

    Submitted 6 August, 2019; originally announced August 2019.

    Comments: 24 pages, 6 figures

  12. arXiv:1901.08022  [pdf, other

    math.OC cs.LG stat.ML

    A Universally Optimal Multistage Accelerated Stochastic Gradient Method

    Authors: Necdet Serhat Aybat, Alireza Fallah, Mert Gurbuzbalaban, Asuman Ozdaglar

    Abstract: We study the problem of minimizing a strongly convex, smooth function when we have noisy estimates of its gradient. We propose a novel multistage accelerated algorithm that is universally optimal in the sense that it achieves the optimal rate both in the deterministic and stochastic case and operates without knowledge of noise characteristics. The algorithm consists of stages that use a stochastic… ▽ More

    Submitted 27 October, 2019; v1 submitted 23 January, 2019; originally announced January 2019.

    Comments: 33rd Conference on Neural Information Processing Systems (NeurIPS 2019)

  13. arXiv:1805.10579  [pdf, other

    math.OC cs.LG stat.ML

    Robust Accelerated Gradient Methods for Smooth Strongly Convex Functions

    Authors: Necdet Serhat Aybat, Alireza Fallah, Mert Gurbuzbalaban, Asuman Ozdaglar

    Abstract: We study the trade-offs between convergence rate and robustness to gradient errors in designing a first-order algorithm. We focus on gradient descent (GD) and accelerated gradient (AG) methods for minimizing strongly convex functions when the gradient has random errors in the form of additive white noise. With gradient errors, the function values of the iterates need not converge to the optimal va… ▽ More

    Submitted 5 November, 2019; v1 submitted 27 May, 2018; originally announced May 2018.

    Comments: To appear in SIAM Journal on Optimization (SIOPT)