Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–10 of 10 results for author: Rimanic, L

Searching in archive cs. Search in all archives.
.
  1. arXiv:2206.01299  [pdf, other

    cs.LG cs.DC

    Fine-tuning Language Models over Slow Networks using Activation Compression with Guarantees

    Authors: Jue Wang, Binhang Yuan, Luka Rimanic, Yongjun He, Tri Dao, Beidi Chen, Christopher Re, Ce Zhang

    Abstract: Communication compression is a crucial technique for modern distributed learning systems to alleviate their communication bottlenecks over slower networks. Despite recent intensive studies of gradient compression for data parallel-style training, compressing the activations for models trained with pipeline parallelism is still an open problem. In this paper, we propose AC-SGD, a novel activation c… ▽ More

    Submitted 6 March, 2023; v1 submitted 2 June, 2022; originally announced June 2022.

  2. arXiv:2204.01457  [pdf, other

    cs.LG cs.DB

    SHiFT: An Efficient, Flexible Search Engine for Transfer Learning

    Authors: Cedric Renggli, Xiaozhe Yao, Luka Kolar, Luka Rimanic, Ana Klimovic, Ce Zhang

    Abstract: Transfer learning can be seen as a data- and compute-efficient alternative to training models from scratch. The emergence of rich model repositories, such as TensorFlow Hub, enables practitioners and researchers to unleash the potential of these models across a wide range of downstream tasks. As these repositories keep growing exponentially, efficiently selecting a good model for the task at hand… ▽ More

    Submitted 28 September, 2022; v1 submitted 4 April, 2022; originally announced April 2022.

  3. arXiv:2108.13034  [pdf, other

    cs.LG

    Evaluating Bayes Error Estimators on Real-World Datasets with FeeBee

    Authors: Cedric Renggli, Luka Rimanic, Nora Hollenstein, Ce Zhang

    Abstract: The Bayes error rate (BER) is a fundamental concept in machine learning that quantifies the best possible accuracy any classifier can achieve on a fixed probability distribution. Despite years of research on building estimators of lower and upper bounds for the BER, these were usually compared only on synthetic datasets with known probability distributions, leaving two key questions unanswered: (1… ▽ More

    Submitted 5 November, 2021; v1 submitted 30 August, 2021; originally announced August 2021.

    Comments: arXiv admin note: text overlap with arXiv:2010.08410

  4. arXiv:2106.06235  [pdf, other

    cs.LG

    Knowledge Enhanced Machine Learning Pipeline against Diverse Adversarial Attacks

    Authors: Nezihe Merve Gürel, Xiangyu Qi, Luka Rimanic, Ce Zhang, Bo Li

    Abstract: Despite the great successes achieved by deep neural networks (DNNs), recent studies show that they are vulnerable against adversarial examples, which aim to mislead DNNs by adding small adversarial perturbations. Several defenses have been proposed against such attacks, while many of them have been adaptively attacked. In this work, we aim to enhance the ML robustness from a different perspective… ▽ More

    Submitted 15 March, 2022; v1 submitted 11 June, 2021; originally announced June 2021.

    Comments: International Conference on Machine Learning 2021, 37 pages, 8 figures, 9 tables

  5. DataLens: Scalable Privacy Preserving Training via Gradient Compression and Aggregation

    Authors: Boxin Wang, Fan Wu, Yunhui Long, Luka Rimanic, Ce Zhang, Bo Li

    Abstract: Recent success of deep neural networks (DNNs) hinges on the availability of large-scale dataset; however, training on such dataset often poses privacy risks for sensitive training information. In this paper, we aim to explore the power of generative models and gradient sparsity, and propose a scalable privacy-preserving generative model DATALENS. Comparing with the standard PATE privacy-preserving… ▽ More

    Submitted 27 March, 2022; v1 submitted 20 March, 2021; originally announced March 2021.

    Comments: Accepted to ACM CCS 2021. 23 pages, 4 figures, 12 tables

  6. arXiv:2102.07750  [pdf, other

    cs.LG cs.DB

    A Data Quality-Driven View of MLOps

    Authors: Cedric Renggli, Luka Rimanic, Nezihe Merve Gürel, Bojan Karlaš, Wentao Wu, Ce Zhang

    Abstract: Developing machine learning models can be seen as a process similar to the one established for traditional software development. A key difference between the two lies in the strong dependency between the quality of a machine learning model and the quality of the data used to train or perform evaluations. In this work, we demonstrate how different aspects of data quality propagate through various s… ▽ More

    Submitted 15 February, 2021; originally announced February 2021.

  7. arXiv:2010.08410  [pdf, other

    cs.LG

    Automatic Feasibility Study via Data Quality Analysis for ML: A Case-Study on Label Noise

    Authors: Cedric Renggli, Luka Rimanic, Luka Kolar, Wentao Wu, Ce Zhang

    Abstract: In our experience of working with domain experts who are using today's AutoML systems, a common problem we encountered is what we call "unrealistic expectations" -- when users are facing a very challenging task with a noisy data acquisition process, while being expected to achieve startlingly high accuracy with machine learning (ML). Many of these are predestined to fail from the beginning. In tra… ▽ More

    Submitted 30 August, 2022; v1 submitted 16 October, 2020; originally announced October 2020.

  8. arXiv:2010.07765  [pdf, other

    cs.LG stat.ML

    On Convergence of Nearest Neighbor Classifiers over Feature Transformations

    Authors: Luka Rimanic, Cedric Renggli, Bo Li, Ce Zhang

    Abstract: The k-Nearest Neighbors (kNN) classifier is a fundamental non-parametric machine learning algorithm. However, it is well known that it suffers from the curse of dimensionality, which is why in practice one often applies a kNN classifier on top of a (pre-trained) feature transformation. From a theoretical perspective, most, if not all theoretical results aimed at understanding the kNN classifier ar… ▽ More

    Submitted 15 October, 2020; originally announced October 2020.

  9. arXiv:2010.06402  [pdf, other

    cs.LG cs.CV

    Which Model to Transfer? Finding the Needle in the Growing Haystack

    Authors: Cedric Renggli, André Susano Pinto, Luka Rimanic, Joan Puigcerver, Carlos Riquelme, Ce Zhang, Mario Lucic

    Abstract: Transfer learning has been recently popularized as a data-efficient alternative to training models from scratch, in particular for computer vision tasks where it provides a remarkably solid baseline. The emergence of rich model repositories, such as TensorFlow Hub, enables the practitioners and researchers to unleash the potential of these models across a wide range of downstream tasks. As these r… ▽ More

    Submitted 25 March, 2022; v1 submitted 13 October, 2020; originally announced October 2020.

  10. arXiv:2002.12398  [pdf, other

    cs.LG cs.CV stat.ML

    TSS: Transformation-Specific Smoothing for Robustness Certification

    Authors: Linyi Li, Maurice Weber, Xiaojun Xu, Luka Rimanic, Bhavya Kailkhura, Tao Xie, Ce Zhang, Bo Li

    Abstract: As machine learning (ML) systems become pervasive, safeguarding their security is critical. However, recently it has been demonstrated that motivated adversaries are able to mislead ML systems by perturbing test data using semantic transformations. While there exists a rich body of research providing provable robustness guarantees for ML models against $\ell_p$ norm bounded adversarial perturbatio… ▽ More

    Submitted 16 November, 2021; v1 submitted 27 February, 2020; originally announced February 2020.

    Comments: 2021 ACM SIGSAC Conference on Computer and Communications Security (CCS '21)