Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–41 of 41 results for author: Talwalkar, A

Searching in archive stat. Search in all archives.
.
  1. arXiv:2310.02246  [pdf, other

    cs.LG cs.AI math.NA stat.ML

    Learning to Relax: Setting Solver Parameters Across a Sequence of Linear System Instances

    Authors: Mikhail Khodak, Edmond Chow, Maria-Florina Balcan, Ameet Talwalkar

    Abstract: Solving a linear system $Ax=b$ is a fundamental scientific computing primitive for which numerous solvers and preconditioners have been developed. These come with parameters whose optimal values depend on the system being solved and are often impossible or too expensive to identify; thus in practice sub-optimal heuristics are used. We consider the common setting in which many related linear system… ▽ More

    Submitted 2 May, 2024; v1 submitted 3 October, 2023; originally announced October 2023.

    Comments: ICLR 2024 Spotlight

  2. arXiv:2210.03324  [pdf, other

    cs.LG cs.AI stat.ML

    AutoML for Climate Change: A Call to Action

    Authors: Renbo Tu, Nicholas Roberts, Vishak Prasad, Sibasis Nayak, Paarth Jain, Frederic Sala, Ganesh Ramakrishnan, Ameet Talwalkar, Willie Neiswanger, Colin White

    Abstract: The challenge that climate change poses to humanity has spurred a rapidly developing field of artificial intelligence research focused on climate change applications. The climate change AI (CCAI) community works on a diverse, challenging set of problems which often involve physics-constrained ML or heterogeneous spatiotemporal data. It would be desirable to use automated machine learning (AutoML)… ▽ More

    Submitted 7 October, 2022; originally announced October 2022.

  3. arXiv:2207.10199  [pdf, other

    cs.LG stat.ML

    Provably tuning the ElasticNet across instances

    Authors: Maria-Florina Balcan, Mikhail Khodak, Dravyansh Sharma, Ameet Talwalkar

    Abstract: An important unresolved challenge in the theory of regularization is to set the regularization coefficients of popular techniques like the ElasticNet with general provable guarantees. We consider the problem of tuning the regularization parameters of Ridge regression, LASSO, and the ElasticNet across multiple problem instances, a setting that encompasses both cross-validation and multi-task hyperp… ▽ More

    Submitted 15 January, 2024; v1 submitted 20 July, 2022; originally announced July 2022.

  4. arXiv:2202.09312  [pdf, other

    cs.LG cs.AI cs.DS stat.ML

    Learning Predictions for Algorithms with Predictions

    Authors: Mikhail Khodak, Maria-Florina Balcan, Ameet Talwalkar, Sergei Vassilvitskii

    Abstract: A burgeoning paradigm in algorithm design is the field of algorithms with predictions, in which algorithms can take advantage of a possibly-imperfect prediction of some aspect of the problem. While much work has focused on using predictions to improve competitive ratios, running times, or other performance measures, less effort has been devoted to the question of how to obtain the predictions them… ▽ More

    Submitted 17 October, 2022; v1 submitted 18 February, 2022; originally announced February 2022.

    Comments: NeurIPS 2022 camera-ready

  5. arXiv:2106.04502  [pdf, other

    cs.LG cs.AI cs.DC stat.ML

    Federated Hyperparameter Tuning: Challenges, Baselines, and Connections to Weight-Sharing

    Authors: Mikhail Khodak, Renbo Tu, Tian Li, Liam Li, Maria-Florina Balcan, Virginia Smith, Ameet Talwalkar

    Abstract: Tuning hyperparameters is a crucial but arduous part of the machine learning pipeline. Hyperparameter optimization is even more challenging in federated learning, where models are learned over a distributed network of heterogeneous devices; here, the need to keep data on device and perform local training makes it difficult to efficiently train and evaluate configurations. In this work, we investig… ▽ More

    Submitted 4 November, 2021; v1 submitted 8 June, 2021; originally announced June 2021.

    Comments: NeurIPS 2021

  6. arXiv:2103.15798  [pdf, other

    cs.LG cs.AI cs.CV math.NA stat.ML

    Rethinking Neural Operations for Diverse Tasks

    Authors: Nicholas Roberts, Mikhail Khodak, Tri Dao, Liam Li, Christopher Ré, Ameet Talwalkar

    Abstract: An important goal of AutoML is to automate-away the design of neural networks on new tasks in under-explored domains. Motivated by this goal, we study the problem of enabling users to discover the right neural operations given data from their specific domain. We introduce a search space of operations called XD-Operations that mimic the inductive bias of standard multi-channel convolutions while be… ▽ More

    Submitted 4 November, 2021; v1 submitted 29 March, 2021; originally announced March 2021.

    Comments: NeurIPS 2021

  7. arXiv:2103.00065  [pdf, other

    cs.LG stat.ML

    Gradient Descent on Neural Networks Typically Occurs at the Edge of Stability

    Authors: Jeremy M. Cohen, Simran Kaur, Yuanzhi Li, J. Zico Kolter, Ameet Talwalkar

    Abstract: We empirically demonstrate that full-batch gradient descent on neural network training objectives typically operates in a regime we call the Edge of Stability. In this regime, the maximum eigenvalue of the training loss Hessian hovers just above the numerical value $2 / \text{(step size)}$, and the training loss behaves non-monotonically over short timescales, yet consistently decreases over long… ▽ More

    Submitted 23 November, 2022; v1 submitted 26 February, 2021; originally announced March 2021.

    Comments: ICLR 2021. v3 moves several figures from the appendix into the main text, and adds more discussion regarding Jastrzębski et al (2020): https://doi.org/10.48550/arXiv.2002.09572

  8. arXiv:2102.00127  [pdf, other

    cs.LG cs.AI stat.ML

    On Data Efficiency of Meta-learning

    Authors: Maruan Al-Shedivat, Liam Li, Eric Xing, Ameet Talwalkar

    Abstract: Meta-learning has enabled learning statistical models that can be quickly adapted to new prediction tasks. Motivated by use-cases in personalized federated learning, we study the often overlooked aspect of the modern meta-learning algorithms -- their data efficiency. To shed more light on which methods are more efficient, we use techniques from algorithmic stability to derive bounds on the transfe… ▽ More

    Submitted 29 January, 2021; originally announced February 2021.

    Comments: Preliminary version. An updated version is to appear in AISTATS 2021

  9. arXiv:2011.01205  [pdf, other

    cs.LG stat.ML

    A Learning Theoretic Perspective on Local Explainability

    Authors: Jeffrey Li, Vaishnavh Nagarajan, Gregory Plumb, Ameet Talwalkar

    Abstract: In this paper, we explore connections between interpretable machine learning and learning theory through the lens of local approximation explanations. First, we tackle the traditional problem of performance generalization and bound the test-time accuracy of a model using a notion of how locally explainable it is. Second, we explore the novel problem of explanation generalization which is an import… ▽ More

    Submitted 2 November, 2020; originally announced November 2020.

  10. arXiv:2004.07802  [pdf, other

    cs.LG cs.CV cs.NE math.OC stat.ML

    Geometry-Aware Gradient Algorithms for Neural Architecture Search

    Authors: Liam Li, Mikhail Khodak, Maria-Florina Balcan, Ameet Talwalkar

    Abstract: Recent state-of-the-art methods for neural architecture search (NAS) exploit gradient-based optimization by relaxing the problem into continuous optimization over architectures and shared-weights, a noisy process that remains poorly understood. We argue for the study of single-level empirical risk minimization to understand NAS with weight-sharing, reducing the design of NAS methods to devising op… ▽ More

    Submitted 18 March, 2021; v1 submitted 16 April, 2020; originally announced April 2020.

    Comments: ICLR 2021 Camera-Ready

  11. arXiv:2004.03424  [pdf, other

    cs.LG cs.CY stat.ML

    FACT: A Diagnostic for Group Fairness Trade-offs

    Authors: Joon Sik Kim, Jiahao Chen, Ameet Talwalkar

    Abstract: Group fairness, a class of fairness notions that measure how different groups of individuals are treated differently according to their protected attributes, has been shown to conflict with one another, often with a necessary cost in loss of model's predictive performance. We propose a general diagnostic that enables systematic characterization of these trade-offs in group fairness. We observe tha… ▽ More

    Submitted 7 July, 2020; v1 submitted 7 April, 2020; originally announced April 2020.

    Comments: Accepted to International Conference on Machine Learning (ICML 2020)

  12. arXiv:2003.01640  [pdf, other

    cs.LG stat.ML

    Explaining Groups of Points in Low-Dimensional Representations

    Authors: Gregory Plumb, Jonathan Terhorst, Sriram Sankararaman, Ameet Talwalkar

    Abstract: A common workflow in data exploration is to learn a low-dimensional representation of the data, identify groups of points in that representation, and examine the differences between the groups to determine what they represent. We treat this workflow as an interpretable machine learning problem by leveraging the model that learned the low-dimensional representation to help identify the key differen… ▽ More

    Submitted 14 August, 2020; v1 submitted 3 March, 2020; originally announced March 2020.

  13. arXiv:2001.01920  [pdf, other

    cs.LG stat.ML

    FedDANE: A Federated Newton-Type Method

    Authors: Tian Li, Anit Kumar Sahu, Manzil Zaheer, Maziar Sanjabi, Ameet Talwalkar, Virginia Smith

    Abstract: Federated learning aims to jointly learn statistical models over massively distributed remote devices. In this work, we propose FedDANE, an optimization method that we adapt from DANE, a method for classical distributed optimization, to handle the practical constraints of federated learning. We provide convergence guarantees for this method when learning over both convex and non-convex functions.… ▽ More

    Submitted 7 January, 2020; originally announced January 2020.

    Comments: Asilomar Conference on Signals, Systems, and Computers 2019

  14. arXiv:1909.05830  [pdf, other

    cs.LG cs.AI cs.CR stat.ML

    Differentially Private Meta-Learning

    Authors: Jeffrey Li, Mikhail Khodak, Sebastian Caldas, Ameet Talwalkar

    Abstract: Parameter-transfer is a well-known and versatile approach for meta-learning, with applications including few-shot learning, federated learning, and reinforcement learning. However, parameter-transfer algorithms often require sharing models that have been trained on the samples from specific tasks, thus leaving the task-owners susceptible to breaches of privacy. We conduct the first formal study of… ▽ More

    Submitted 21 February, 2020; v1 submitted 12 September, 2019; originally announced September 2019.

  15. arXiv:1908.07873  [pdf, other

    cs.LG cs.DC stat.ML

    Federated Learning: Challenges, Methods, and Future Directions

    Authors: Tian Li, Anit Kumar Sahu, Ameet Talwalkar, Virginia Smith

    Abstract: Federated learning involves training statistical models over remote devices or siloed data centers, such as mobile phones or hospitals, while keeping data localized. Training in heterogeneous and potentially massive networks introduces novel challenges that require a fundamental departure from standard approaches for large-scale machine learning, distributed optimization, and privacy-preserving da… ▽ More

    Submitted 21 August, 2019; originally announced August 2019.

  16. arXiv:1906.11813  [pdf, ps, other

    cs.LG stat.ML

    Learning Fair Representations for Kernel Models

    Authors: Zilong Tan, Samuel Yeom, Matt Fredrikson, Ameet Talwalkar

    Abstract: Fair representations are a powerful tool for establishing criteria like statistical parity, proxy non-discrimination, and equality of opportunity in learned models. Existing techniques for learning these representations are typically model-agnostic, as they preprocess the original data such that the output satisfies some fairness criterion, and can be used with arbitrary learning methods. In contr… ▽ More

    Submitted 20 January, 2020; v1 submitted 27 June, 2019; originally announced June 2019.

    Comments: The 23rd International Conference on Artificial Intelligence and Statistics (AISTATS 2020)

  17. arXiv:1906.02717  [pdf, other

    cs.LG cs.AI stat.ML

    Adaptive Gradient-Based Meta-Learning Methods

    Authors: Mikhail Khodak, Maria-Florina Balcan, Ameet Talwalkar

    Abstract: We build a theoretical framework for designing and understanding practical meta-learning methods that integrates sophisticated formalizations of task-similarity with the extensive literature on online convex optimization and sequential prediction algorithms. Our approach enables the task-similarity to be learned adaptively, provides sharper transfer-risk bounds in the setting of statistical learni… ▽ More

    Submitted 6 December, 2019; v1 submitted 6 June, 2019; originally announced June 2019.

    Comments: NeurIPS 2019

  18. arXiv:1906.01431  [pdf, other

    cs.LG stat.ML

    Regularizing Black-box Models for Improved Interpretability (HILL 2019 Version)

    Authors: Gregory Plumb, Maruan Al-Shedivat, Eric Xing, Ameet Talwalkar

    Abstract: Most of the work on interpretable machine learning has focused on designing either inherently interpretable models, which typically trade-off accuracy for interpretability, or post-hoc explanation systems, which lack guarantees about their explanation quality. We propose an alternative to these approaches by directly regularizing a black-box model for interpretability at training time. Our approac… ▽ More

    Submitted 31 May, 2019; originally announced June 2019.

    Comments: presented at 2019 ICML Workshop on Human in the Loop Learning (HILL 2019), Long Beach, USA. arXiv admin note: substantial text overlap with arXiv:1902.06787

  19. arXiv:1904.03257  [pdf, ps, other

    cs.LG cs.DB cs.DC cs.SE stat.ML

    MLSys: The New Frontier of Machine Learning Systems

    Authors: Alexander Ratner, Dan Alistarh, Gustavo Alonso, David G. Andersen, Peter Bailis, Sarah Bird, Nicholas Carlini, Bryan Catanzaro, Jennifer Chayes, Eric Chung, Bill Dally, Jeff Dean, Inderjit S. Dhillon, Alexandros Dimakis, Pradeep Dubey, Charles Elkan, Grigori Fursin, Gregory R. Ganger, Lise Getoor, Phillip B. Gibbons, Garth A. Gibson, Joseph E. Gonzalez, Justin Gottschlich, Song Han, Kim Hazelwood , et al. (44 additional authors not shown)

    Abstract: Machine learning (ML) techniques are enjoying rapidly increasing adoption. However, designing and implementing the systems that support ML models in real-world deployments remains a significant obstacle, in large part due to the radically different development and deployment profile of modern ML methods, and the range of practical concerns that come with broader adoption. We propose to foster a ne… ▽ More

    Submitted 1 December, 2019; v1 submitted 29 March, 2019; originally announced April 2019.

  20. arXiv:1903.05176  [pdf, other

    cs.LG stat.ML

    Exploiting Reuse in Pipeline-Aware Hyperparameter Tuning

    Authors: Liam Li, Evan Sparks, Kevin Jamieson, Ameet Talwalkar

    Abstract: Hyperparameter tuning of multi-stage pipelines introduces a significant computational burden. Motivated by the observation that work can be reused across pipelines if the intermediate computations are the same, we propose a pipeline-aware approach to hyperparameter tuning. Our approach optimizes both the design and execution of pipelines to maximize reuse. We design pipelines amenable for reuse by… ▽ More

    Submitted 12 March, 2019; originally announced March 2019.

  21. arXiv:1902.11175  [pdf, other

    cs.LG stat.ML

    One-Shot Federated Learning

    Authors: Neel Guha, Ameet Talwalkar, Virginia Smith

    Abstract: We present one-shot federated learning, where a central server learns a global model over a network of federated devices in a single round of communication. Our approach - drawing on ensemble learning and knowledge aggregation - achieves an average relative gain of 51.5% in AUC over local baselines and comes within 90.1% of the (unattainable) global ideal. We discuss these methods and identify sev… ▽ More

    Submitted 5 March, 2019; v1 submitted 28 February, 2019; originally announced February 2019.

    Comments: 5 pages, 3 figures, 1 table. 2nd Workshop on Machine Learning on the Phone and other Consumer Devices, NeurIPs 2018

  22. arXiv:1902.10644  [pdf, other

    cs.LG cs.AI stat.ML

    Provable Guarantees for Gradient-Based Meta-Learning

    Authors: Mikhail Khodak, Maria-Florina Balcan, Ameet Talwalkar

    Abstract: We study the problem of meta-learning through the lens of online convex optimization, developing a meta-algorithm bridging the gap between popular gradient-based meta-learning and classical regularization-based multi-task transfer methods. Our method is the first to simultaneously satisfy good sample efficiency guarantees in the convex setting, with generalization bounds that improve with task-sim… ▽ More

    Submitted 16 May, 2019; v1 submitted 27 February, 2019; originally announced February 2019.

    Comments: ICML 2019

  23. arXiv:1902.07638  [pdf, other

    cs.LG stat.ML

    Random Search and Reproducibility for Neural Architecture Search

    Authors: Liam Li, Ameet Talwalkar

    Abstract: Neural architecture search (NAS) is a promising research direction that has the potential to replace expert-designed networks with learned, task-specific architectures. In this work, in order to help ground the empirical results in this field, we propose new NAS baselines that build off the following observations: (i) NAS is a specialized hyperparameter optimization problem; and (ii) random search… ▽ More

    Submitted 30 July, 2019; v1 submitted 20 February, 2019; originally announced February 2019.

    Comments: V2 Changelog: - Modified footnote 2 for ENAS. - Expanded broad reproducibility study for random search with WS for CNN to 6 sets of random seeds v3 Changelog: - Added journal reference - Updated acknowledgements

    Journal ref: Conference on Uncertainty in Artificial Intelligence (UAI), 2019

  24. arXiv:1902.06787  [pdf, other

    cs.LG stat.ML

    Regularizing Black-box Models for Improved Interpretability

    Authors: Gregory Plumb, Maruan Al-Shedivat, Angel Alexander Cabrera, Adam Perer, Eric Xing, Ameet Talwalkar

    Abstract: Most of the work on interpretable machine learning has focused on designing either inherently interpretable models, which typically trade-off accuracy for interpretability, or post-hoc explanation systems, whose explanation quality can be unpredictable. Our method, ExpO, is a hybridization of these approaches that regularizes a model for explanation quality at training time. Importantly, these reg… ▽ More

    Submitted 8 November, 2020; v1 submitted 18 February, 2019; originally announced February 2019.

  25. arXiv:1812.07210  [pdf, other

    cs.LG cs.DC stat.ML

    Expanding the Reach of Federated Learning by Reducing Client Resource Requirements

    Authors: Sebastian Caldas, Jakub Konečny, H. Brendan McMahan, Ameet Talwalkar

    Abstract: Communication on heterogeneous edge networks is a fundamental bottleneck in Federated Learning (FL), restricting both model capacity and user participation. To address this issue, we introduce two novel strategies to reduce communication costs: (1) the use of lossy compression on the global model sent server-to-client; and (2) Federated Dropout, which allows users to efficiently train locally on s… ▽ More

    Submitted 8 January, 2019; v1 submitted 18 December, 2018; originally announced December 2018.

  26. arXiv:1812.06127  [pdf, other

    cs.LG stat.ML

    Federated Optimization in Heterogeneous Networks

    Authors: Tian Li, Anit Kumar Sahu, Manzil Zaheer, Maziar Sanjabi, Ameet Talwalkar, Virginia Smith

    Abstract: Federated Learning is a distributed learning paradigm with two key challenges that differentiate it from traditional distributed optimization: (1) significant variability in terms of the systems characteristics on each device in the network (systems heterogeneity), and (2) non-identically distributed data across the network (statistical heterogeneity). In this work, we introduce a framework, FedPr… ▽ More

    Submitted 21 April, 2020; v1 submitted 14 December, 2018; originally announced December 2018.

    Comments: MLSys 2020

  27. arXiv:1812.01097  [pdf, other

    cs.LG stat.ML

    LEAF: A Benchmark for Federated Settings

    Authors: Sebastian Caldas, Sai Meher Karthik Duddu, Peter Wu, Tian Li, Jakub Konečný, H. Brendan McMahan, Virginia Smith, Ameet Talwalkar

    Abstract: Modern federated networks, such as those comprised of wearable devices, mobile phones, or autonomous vehicles, generate massive amounts of data each day. This wealth of data can help to learn models that can improve the user experience on each device. However, the scale and heterogeneity of federated data presents new challenges in research areas such as federated learning, meta-learning, and mult… ▽ More

    Submitted 9 December, 2019; v1 submitted 3 December, 2018; originally announced December 2018.

  28. arXiv:1810.05934  [pdf, other

    cs.LG stat.ML

    A System for Massively Parallel Hyperparameter Tuning

    Authors: Liam Li, Kevin Jamieson, Afshin Rostamizadeh, Ekaterina Gonina, Moritz Hardt, Benjamin Recht, Ameet Talwalkar

    Abstract: Modern learning models are characterized by large hyperparameter spaces and long training times. These properties, coupled with the rise of parallel computing and the growing demand to productionize machine learning workloads, motivate the need to develop mature hyperparameter optimization functionality in distributed computing settings. We address this challenge by first introducing a simple and… ▽ More

    Submitted 15 March, 2020; v1 submitted 13 October, 2018; originally announced October 2018.

    Comments: v2: Corrected typo in Algorithm 1 v3: Added comparison to BOHB and parallel version of synchronous SHA. Add PBT to experiment in Section 4.3.1 v4: Added acknowledgements and slight edit to related work

    Journal ref: Conference on Machine Learning and Systems 2020

  29. arXiv:1807.02910  [pdf, other

    cs.LG stat.ML

    Model Agnostic Supervised Local Explanations

    Authors: Gregory Plumb, Denali Molitor, Ameet Talwalkar

    Abstract: Model interpretability is an increasingly important component of practical machine learning. Some of the most common forms of interpretability systems are example-based, local, and global explanations. One of the main challenges in interpretability is designing explanation systems that can capture aspects of each of these explanation types, in order to develop a more thorough understanding of the… ▽ More

    Submitted 5 January, 2019; v1 submitted 8 July, 2018; originally announced July 2018.

  30. arXiv:1707.00424  [pdf, other

    cs.LG cs.DC stat.ML

    Parle: parallelizing stochastic gradient descent

    Authors: Pratik Chaudhari, Carlo Baldassi, Riccardo Zecchina, Stefano Soatto, Ameet Talwalkar, Adam Oberman

    Abstract: We propose a new algorithm called Parle for parallel training of deep networks that converges 2-4x faster than a data-parallel implementation of SGD, while achieving significantly improved error rates that are nearly state-of-the-art on several benchmarks including CIFAR-10 and CIFAR-100, without introducing any additional hyper-parameters. We exploit the phenomenon of flat minima that has been sh… ▽ More

    Submitted 10 September, 2017; v1 submitted 3 July, 2017; originally announced July 2017.

  31. arXiv:1705.10467  [pdf, other

    cs.LG stat.ML

    Federated Multi-Task Learning

    Authors: Virginia Smith, Chao-Kai Chiang, Maziar Sanjabi, Ameet Talwalkar

    Abstract: Federated learning poses new statistical and systems challenges in training machine learning models over distributed networks of devices. In this work, we show that multi-task learning is naturally suited to handle the statistical challenges of this setting, and propose a novel systems-aware optimization method, MOCHA, that is robust to practical systems issues. Our method and theory for the first… ▽ More

    Submitted 27 February, 2018; v1 submitted 30 May, 2017; originally announced May 2017.

  32. arXiv:1603.06560  [pdf, other

    cs.LG stat.ML

    Hyperband: A Novel Bandit-Based Approach to Hyperparameter Optimization

    Authors: Lisha Li, Kevin Jamieson, Giulia DeSalvo, Afshin Rostamizadeh, Ameet Talwalkar

    Abstract: Performance of machine learning algorithms depends critically on identifying a good set of hyperparameters. While recent approaches use Bayesian optimization to adaptively select configurations, we focus on speeding up random search through adaptive resource allocation and early-stopping. We formulate hyperparameter optimization as a pure-exploration non-stochastic infinite-armed bandit problem wh… ▽ More

    Submitted 18 June, 2018; v1 submitted 21 March, 2016; originally announced March 2016.

    Comments: Changes: - Updated to JMLR version

    Journal ref: Journal of Machine Learning Research 18 (2018) 1-52

  33. arXiv:1505.06807  [pdf, other

    cs.LG cs.DC cs.MS stat.ML

    MLlib: Machine Learning in Apache Spark

    Authors: Xiangrui Meng, Joseph Bradley, Burak Yavuz, Evan Sparks, Shivaram Venkataraman, Davies Liu, Jeremy Freeman, DB Tsai, Manish Amde, Sean Owen, Doris Xin, Reynold Xin, Michael J. Franklin, Reza Zadeh, Matei Zaharia, Ameet Talwalkar

    Abstract: Apache Spark is a popular open-source platform for large-scale data processing that is well-suited for iterative machine learning tasks. In this paper we present MLlib, Spark's open-source distributed machine learning library. MLlib provides efficient functionality for a wide range of learning settings and includes several underlying statistical, optimization, and linear algebra primitives. Shippe… ▽ More

    Submitted 26 May, 2015; originally announced May 2015.

  34. arXiv:1502.07943  [pdf, other

    cs.LG stat.ML

    Non-stochastic Best Arm Identification and Hyperparameter Optimization

    Authors: Kevin Jamieson, Ameet Talwalkar

    Abstract: Motivated by the task of hyperparameter optimization, we introduce the non-stochastic best-arm identification problem. Within the multi-armed bandit literature, the cumulative regret objective enjoys algorithms and analyses for both the non-stochastic and stochastic settings while to the best of our knowledge, the best-arm identification framework has only been considered in the stochastic setting… ▽ More

    Submitted 27 February, 2015; originally announced February 2015.

  35. arXiv:1408.2044  [pdf

    cs.LG stat.ML

    Matrix Coherence and the Nystrom Method

    Authors: Ameet Talwalkar, Afshin Rostamizadeh

    Abstract: The Nystrom method is an efficient technique used to speed up large-scale learning applications by generating low-rank approximations. Crucial to the performance of this technique is the assumption that a matrix can be well approximated by working exclusively with a subset of its columns. In this work we relate this assumption to the concept of matrix coherence, connecting coherence to the perform… ▽ More

    Submitted 9 August, 2014; originally announced August 2014.

    Comments: Appears in Proceedings of the Twenty-Sixth Conference on Uncertainty in Artificial Intelligence (UAI2010)

    Report number: UAI-P-2010-PG-572-579

  36. arXiv:1310.5426  [pdf, other

    cs.LG cs.DC stat.ML

    MLI: An API for Distributed Machine Learning

    Authors: Evan R. Sparks, Ameet Talwalkar, Virginia Smith, Jey Kottalam, Xinghao Pan, Joseph Gonzalez, Michael J. Franklin, Michael I. Jordan, Tim Kraska

    Abstract: MLI is an Application Programming Interface designed to address the challenges of building Machine Learn- ing algorithms in a distributed setting based on data-centric computing. Its primary goal is to simplify the development of high-performance, scalable, distributed algorithms. Our initial results show that, relative to existing systems, this interface can be used to build distributed implement… ▽ More

    Submitted 25 October, 2013; v1 submitted 21 October, 2013; originally announced October 2013.

  37. arXiv:1304.5583  [pdf, ps, other

    cs.CV cs.DC cs.LG stat.ML

    Distributed Low-rank Subspace Segmentation

    Authors: Ameet Talwalkar, Lester Mackey, Yadong Mu, Shih-Fu Chang, Michael I. Jordan

    Abstract: Vision problems ranging from image clustering to motion segmentation to semi-supervised learning can naturally be framed as subspace segmentation problems, in which one aims to recover multiple low-dimensional subspaces from noisy and corrupted input data. Low-Rank Representation (LRR), a convex formulation of the subspace segmentation problem, is provably and empirically accurate on small problem… ▽ More

    Submitted 15 October, 2013; v1 submitted 19 April, 2013; originally announced April 2013.

  38. arXiv:1206.6415  [pdf

    cs.LG stat.ML

    The Big Data Bootstrap

    Authors: Ariel Kleiner, Ameet Talwalkar, Purnamrita Sarkar, Michael Jordan

    Abstract: The bootstrap provides a simple and powerful means of assessing the quality of estimators. However, in settings involving large datasets, the computation of bootstrap-based quantities can be prohibitively demanding. As an alternative, we present the Bag of Little Bootstraps (BLB), a new procedure which incorporates features of both the bootstrap and subsampling to obtain a robust, computationally… ▽ More

    Submitted 27 June, 2012; originally announced June 2012.

    Comments: Appears in Proceedings of the 29th International Conference on Machine Learning (ICML 2012). arXiv admin note: text overlap with arXiv:1112.5016

  39. arXiv:1112.5016  [pdf, other

    stat.ME stat.CO stat.ML

    A Scalable Bootstrap for Massive Data

    Authors: Ariel Kleiner, Ameet Talwalkar, Purnamrita Sarkar, Michael I. Jordan

    Abstract: The bootstrap provides a simple and powerful means of assessing the quality of estimators. However, in settings involving large datasets---which are increasingly prevalent---the computation of bootstrap-based quantities can be prohibitively demanding computationally. While variants such as subsampling and the $m$ out of $n$ bootstrap can be used in principle to reduce the cost of bootstrap computa… ▽ More

    Submitted 27 June, 2012; v1 submitted 21 December, 2011; originally announced December 2011.

  40. arXiv:1107.0789  [pdf, ps, other

    cs.LG cs.DS math.NA stat.ML

    Distributed Matrix Completion and Robust Factorization

    Authors: Lester Mackey, Ameet Talwalkar, Michael I. Jordan

    Abstract: If learning methods are to scale to the massive sizes of modern datasets, it is essential for the field of machine learning to embrace parallel and distributed computing. Inspired by the recent development of matrix factorization methods with rich theory but poor computational complexity and by the relative ease of mapping matrices onto distributed architectures, we introduce a scalable divide-and… ▽ More

    Submitted 28 October, 2013; v1 submitted 5 July, 2011; originally announced July 2011.

    Comments: 35 pages, 6 figures

  41. arXiv:1009.0861  [pdf, ps, other

    stat.ML cs.AI cs.LG

    On the Estimation of Coherence

    Authors: Mehryar Mohri, Ameet Talwalkar

    Abstract: Low-rank matrix approximations are often used to help scale standard machine learning algorithms to large-scale problems. Recently, matrix coherence has been used to characterize the ability to extract global information from a subset of matrix entries in the context of these low-rank approximations and other sampling-based algorithms, e.g., matrix com- pletion, robust PCA. Since coherence is defi… ▽ More

    Submitted 4 September, 2010; originally announced September 2010.