Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–18 of 18 results for author: Renggli, C

.
  1. arXiv:2306.11182  [pdf, other

    cs.LG cs.DB cs.IR

    Co-design Hardware and Algorithm for Vector Search

    Authors: Wenqi Jiang, Shigang Li, Yu Zhu, Johannes de Fine Licht, Zhenhao He, Runbin Shi, Cedric Renggli, Shuai Zhang, Theodoros Rekatsinas, Torsten Hoefler, Gustavo Alonso

    Abstract: Vector search has emerged as the foundation for large-scale information retrieval and machine learning systems, with search engines like Google and Bing processing tens of thousands of queries per second on petabyte-scale document datasets by evaluating vector similarities between encoded query texts and web documents. As performance demands for vector search systems surge, accelerated hardware of… ▽ More

    Submitted 6 July, 2023; v1 submitted 19 June, 2023; originally announced June 2023.

    Comments: 11 pages

  2. arXiv:2206.05830  [pdf, other

    cs.LG

    Stochastic Gradient Descent without Full Data Shuffle

    Authors: Lijie Xu, Shuang Qiu, Binhang Yuan, Jiawei Jiang, Cedric Renggli, Shaoduo Gan, Kaan Kara, Guoliang Li, Ji Liu, Wentao Wu, Jieping Ye, Ce Zhang

    Abstract: Stochastic gradient descent (SGD) is the cornerstone of modern machine learning (ML) systems. Despite its computational efficiency, SGD requires random data access that is inherently inefficient when implemented in systems that rely on block-addressable secondary storage such as HDD and SSD, e.g., TensorFlow/PyTorch and in-DB ML systems over large files. To address this impedance mismatch, various… ▽ More

    Submitted 12 June, 2022; originally announced June 2022.

    Comments: This technical report is an extension of our SIGMOD 2022 paper titled "In-Database Machine Learning with CorgiPile: Stochastic Gradient Descent without Full Data Shuffle". https://doi.org/10.1145/3514221.3526150

  3. arXiv:2204.01457  [pdf, other

    cs.LG cs.DB

    SHiFT: An Efficient, Flexible Search Engine for Transfer Learning

    Authors: Cedric Renggli, Xiaozhe Yao, Luka Kolar, Luka Rimanic, Ana Klimovic, Ce Zhang

    Abstract: Transfer learning can be seen as a data- and compute-efficient alternative to training models from scratch. The emergence of rich model repositories, such as TensorFlow Hub, enables practitioners and researchers to unleash the potential of these models across a wide range of downstream tasks. As these repositories keep growing exponentially, efficiently selecting a good model for the task at hand… ▽ More

    Submitted 28 September, 2022; v1 submitted 4 April, 2022; originally announced April 2022.

  4. arXiv:2203.04428  [pdf, other

    cs.CR

    DeepSE-WF: Unified Security Estimation for Website Fingerprinting Defenses

    Authors: Alexander Veicht, Cedric Renggli, Diogo Barradas

    Abstract: Website fingerprinting (WF) attacks, usually conducted with the help of a machine learning-based classifier, enable a network eavesdropper to pinpoint which web page a user is accessing through the inspection of traffic patterns. These attacks have been shown to succeed even when users browse the Internet through encrypted tunnels, e.g., through Tor or VPNs. To assess the security of new defenses… ▽ More

    Submitted 28 November, 2022; v1 submitted 8 March, 2022; originally announced March 2022.

    Comments: Major revision - added experiments with new dataset and alternative neural network architectures for estimating the BER

  5. arXiv:2202.12015  [pdf, other

    cs.CV cs.LG

    Learning to Merge Tokens in Vision Transformers

    Authors: Cedric Renggli, André Susano Pinto, Neil Houlsby, Basil Mustafa, Joan Puigcerver, Carlos Riquelme

    Abstract: Transformers are widely applied to solve natural language understanding and computer vision tasks. While scaling up these architectures leads to improved performance, it often comes at the expense of much higher computational costs. In order for large-scale models to remain practical in real-world systems, there is a need for reducing their computational overhead. In this work, we present the Patc… ▽ More

    Submitted 24 February, 2022; originally announced February 2022.

    Comments: 11 pages, 9 figures

  6. arXiv:2112.08048  [pdf, other

    cs.CL

    Dynamic Human Evaluation for Relative Model Comparisons

    Authors: Thórhildur Thorleiksdóttir, Cedric Renggli, Nora Hollenstein, Ce Zhang

    Abstract: Collecting human judgements is currently the most reliable evaluation method for natural language generation systems. Automatic metrics have reported flaws when applied to measure quality aspects of generated text and have been shown to correlate poorly with human judgements. However, human evaluation is time and cost-intensive, and we lack consensus on designing and conducting human evaluation ex… ▽ More

    Submitted 28 April, 2022; v1 submitted 15 December, 2021; originally announced December 2021.

    Comments: accepted at LREC 2022

  7. arXiv:2108.13034  [pdf, other

    cs.LG

    Evaluating Bayes Error Estimators on Real-World Datasets with FeeBee

    Authors: Cedric Renggli, Luka Rimanic, Nora Hollenstein, Ce Zhang

    Abstract: The Bayes error rate (BER) is a fundamental concept in machine learning that quantifies the best possible accuracy any classifier can achieve on a fixed probability distribution. Despite years of research on building estimators of lower and upper bounds for the BER, these were usually compared only on synthetic datasets with known probability distributions, leaving two key questions unanswered: (1… ▽ More

    Submitted 5 November, 2021; v1 submitted 30 August, 2021; originally announced August 2021.

    Comments: arXiv admin note: text overlap with arXiv:2010.08410

  8. Decoding EEG Brain Activity for Multi-Modal Natural Language Processing

    Authors: Nora Hollenstein, Cedric Renggli, Benjamin Glaus, Maria Barrett, Marius Troendle, Nicolas Langer, Ce Zhang

    Abstract: Until recently, human behavioral data from reading has mainly been of interest to researchers to understand human cognition. However, these human language processing signals can also be beneficial in machine learning-based natural language processing tasks. Using EEG brain activity to this purpose is largely unexplored as of yet. In this paper, we present the first large-scale study of systematica… ▽ More

    Submitted 13 July, 2021; v1 submitted 17 February, 2021; originally announced February 2021.

    Journal ref: Frontiers of Human Neuroscience 2021

  9. arXiv:2102.07750  [pdf, other

    cs.LG cs.DB

    A Data Quality-Driven View of MLOps

    Authors: Cedric Renggli, Luka Rimanic, Nezihe Merve Gürel, Bojan Karlaš, Wentao Wu, Ce Zhang

    Abstract: Developing machine learning models can be seen as a process similar to the one established for traditional software development. A key difference between the two lies in the strong dependency between the quality of a machine learning model and the quality of the data used to train or perform evaluations. In this work, we demonstrate how different aspects of data quality propagate through various s… ▽ More

    Submitted 15 February, 2021; originally announced February 2021.

  10. arXiv:2010.08410  [pdf, other

    cs.LG

    Automatic Feasibility Study via Data Quality Analysis for ML: A Case-Study on Label Noise

    Authors: Cedric Renggli, Luka Rimanic, Luka Kolar, Wentao Wu, Ce Zhang

    Abstract: In our experience of working with domain experts who are using today's AutoML systems, a common problem we encountered is what we call "unrealistic expectations" -- when users are facing a very challenging task with a noisy data acquisition process, while being expected to achieve startlingly high accuracy with machine learning (ML). Many of these are predestined to fail from the beginning. In tra… ▽ More

    Submitted 30 August, 2022; v1 submitted 16 October, 2020; originally announced October 2020.

  11. arXiv:2010.07765  [pdf, other

    cs.LG stat.ML

    On Convergence of Nearest Neighbor Classifiers over Feature Transformations

    Authors: Luka Rimanic, Cedric Renggli, Bo Li, Ce Zhang

    Abstract: The k-Nearest Neighbors (kNN) classifier is a fundamental non-parametric machine learning algorithm. However, it is well known that it suffers from the curse of dimensionality, which is why in practice one often applies a kNN classifier on top of a (pre-trained) feature transformation. From a theoretical perspective, most, if not all theoretical results aimed at understanding the kNN classifier ar… ▽ More

    Submitted 15 October, 2020; originally announced October 2020.

  12. arXiv:2010.06402  [pdf, other

    cs.LG cs.CV

    Which Model to Transfer? Finding the Needle in the Growing Haystack

    Authors: Cedric Renggli, André Susano Pinto, Luka Rimanic, Joan Puigcerver, Carlos Riquelme, Ce Zhang, Mario Lucic

    Abstract: Transfer learning has been recently popularized as a data-efficient alternative to training models from scratch, in particular for computer vision tasks where it provides a remarkably solid baseline. The emergence of rich model repositories, such as TensorFlow Hub, enables the practitioners and researchers to unleash the potential of these models across a wide range of downstream tasks. As these r… ▽ More

    Submitted 25 March, 2022; v1 submitted 13 October, 2020; originally announced October 2020.

  13. arXiv:2009.13239  [pdf, other

    cs.LG cs.CV stat.ML

    Scalable Transfer Learning with Expert Models

    Authors: Joan Puigcerver, Carlos Riquelme, Basil Mustafa, Cedric Renggli, André Susano Pinto, Sylvain Gelly, Daniel Keysers, Neil Houlsby

    Abstract: Transfer of pre-trained representations can improve sample efficiency and reduce computational requirements for new tasks. However, representations used for transfer are usually generic, and are not tailored to a particular distribution of downstream tasks. We explore the use of expert representations for transfer with a simple, yet effective, strategy. We train a diverse set of experts by exploit… ▽ More

    Submitted 28 September, 2020; originally announced September 2020.

  14. arXiv:1910.03472  [pdf, other

    eess.IV cs.CV cs.LG

    Observer Dependent Lossy Image Compression

    Authors: Maurice Weber, Cedric Renggli, Helmut Grabner, Ce Zhang

    Abstract: Deep neural networks have recently advanced the state-of-the-art in image compression and surpassed many traditional compression algorithms. The training of such networks involves carefully trading off entropy of the latent representation against reconstruction quality. The term quality crucially depends on the observer of the images which, in the vast majority of literature, is assumed to be huma… ▽ More

    Submitted 2 November, 2020; v1 submitted 8 October, 2019; originally announced October 2019.

    Comments: @German Conference on Pattern Recognition (DAGM GCPR 2020)

  15. arXiv:1903.00278  [pdf, other

    cs.LG stat.ML

    Continuous Integration of Machine Learning Models with ease.ml/ci: Towards a Rigorous Yet Practical Treatment

    Authors: Cedric Renggli, Bojan Karlaš, Bolin Ding, Feng Liu, Kevin Schawinski, Wentao Wu, Ce Zhang

    Abstract: Continuous integration is an indispensable step of modern software engineering practices to systematically manage the life cycles of system development. Developing a machine learning model is no difference - it is an engineering process with a life cycle, including design, implementation, tuning, testing, and deployment. However, most, if not all, existing continuous integration engines do not sup… ▽ More

    Submitted 1 March, 2019; originally announced March 2019.

  16. arXiv:1810.07766  [pdf, other

    cs.DC cs.LG

    Distributed Learning over Unreliable Networks

    Authors: Chen Yu, Hanlin Tang, Cedric Renggli, Simon Kassing, Ankit Singla, Dan Alistarh, Ce Zhang, Ji Liu

    Abstract: Most of today's distributed machine learning systems assume {\em reliable networks}: whenever two machines exchange information (e.g., gradients or models), the network should guarantee the delivery of the message. At the same time, recent work exhibits the impressive tolerance of machine learning algorithms to errors or noise arising from relaxed communication or synchronization. In this paper, w… ▽ More

    Submitted 15 May, 2019; v1 submitted 17 October, 2018; originally announced October 2018.

  17. arXiv:1809.10505  [pdf, other

    cs.LG cs.DC stat.ML

    The Convergence of Sparsified Gradient Methods

    Authors: Dan Alistarh, Torsten Hoefler, Mikael Johansson, Sarit Khirirat, Nikola Konstantinov, Cédric Renggli

    Abstract: Distributed training of massive machine learning models, in particular deep neural networks, via Stochastic Gradient Descent (SGD) is becoming commonplace. Several families of communication-reduction methods, such as quantization, large-batch methods, and gradient sparsification, have been proposed. To date, gradient sparsification methods - where each node sorts gradients by magnitude, and only c… ▽ More

    Submitted 27 September, 2018; originally announced September 2018.

    Comments: NIPS 2018 - Advances in Neural Information Processing Systems; Authors in alphabetic order

  18. arXiv:1802.08021  [pdf, other

    cs.DC stat.ML

    SparCML: High-Performance Sparse Communication for Machine Learning

    Authors: Cedric Renggli, Saleh Ashkboos, Mehdi Aghagolzadeh, Dan Alistarh, Torsten Hoefler

    Abstract: Applying machine learning techniques to the quickly growing data in science and industry requires highly-scalable algorithms. Large datasets are most commonly processed "data parallel" distributed across many nodes. Each node's contribution to the overall gradient is summed using a global allreduce. This allreduce is the single communication and thus scalability bottleneck for most machine learnin… ▽ More

    Submitted 16 August, 2019; v1 submitted 22 February, 2018; originally announced February 2018.