Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–50 of 98 results for author: Hein, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.18026  [pdf, other

    eess.IV cs.CV

    Segmentation-guided MRI reconstruction for meaningfully diverse reconstructions

    Authors: Jan Nikolas Morshuis, Matthias Hein, Christian F. Baumgartner

    Abstract: Inverse problems, such as accelerated MRI reconstruction, are ill-posed and an infinite amount of possible and plausible solutions exist. This may not only lead to uncertainty in the reconstructed image but also in downstream tasks such as semantic segmentation. This uncertainty, however, is mostly not analyzed in the literature, even though probabilistic reconstruction models are commonly used. T… ▽ More

    Submitted 25 July, 2024; originally announced July 2024.

    Comments: Accepted at DGM4MICCAI 2024

  2. arXiv:2407.03848  [pdf, other

    cs.LG

    Bias of Stochastic Gradient Descent or the Architecture: Disentangling the Effects of Overparameterization of Neural Networks

    Authors: Amit Peleg, Matthias Hein

    Abstract: Neural networks typically generalize well when fitting the data perfectly, even though they are heavily overparameterized. Many factors have been pointed out as the reason for this phenomenon, including an implicit bias of stochastic gradient descent (SGD) and a possible simplicity bias arising from the neural network architecture. The goal of this paper is to disentangle the factors that influenc… ▽ More

    Submitted 4 July, 2024; originally announced July 2024.

    Comments: Accepted at ICML 2024

  3. arXiv:2407.03224  [pdf

    cs.RO cs.AI eess.SY

    PPO-based Dynamic Control of Uncertain Floating Platforms in the Zero-G Environment

    Authors: Mahya Ramezani, M. Amin Alandihallaj, Andreas M. Hein

    Abstract: In the field of space exploration, floating platforms play a crucial role in scientific investigations and technological advancements. However, controlling these platforms in zero-gravity environments presents unique challenges, including uncertainties and disturbances. This paper introduces an innovative approach that combines Proximal Policy Optimization (PPO) with Model Predictive Control (MPC)… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

    Comments: Pre-print version submitted to 2024 International Conference on Robotics and Automation (ICRA)

  4. arXiv:2405.17447  [pdf, other

    cs.CV cs.LG

    How to train your ViT for OOD Detection

    Authors: Maximilian Mueller, Matthias Hein

    Abstract: VisionTransformers have been shown to be powerful out-of-distribution detectors for ImageNet-scale settings when finetuned from publicly available checkpoints, often outperforming other model types on popular benchmarks. In this work, we investigate the impact of both the pretraining and finetuning scheme on the performance of ViTs on this task by analyzing a large pool of models. We find that the… ▽ More

    Submitted 21 May, 2024; originally announced May 2024.

    Comments: arXiv admin note: text overlap with arXiv:2306.00826

  5. arXiv:2404.16637  [pdf, other

    cs.CV

    Zero-Shot Distillation for Image Encoders: How to Make Effective Use of Synthetic Data

    Authors: Niclas Popp, Jan Hendrik Metzen, Matthias Hein

    Abstract: Multi-modal foundation models such as CLIP have showcased impressive zero-shot capabilities. However, their applicability in resource-constrained environments is limited due to their large number of parameters and high inference time. While existing approaches have scaled down the entire CLIP architecture, we focus on training smaller variants of the image encoder, which suffices for efficient zer… ▽ More

    Submitted 25 April, 2024; originally announced April 2024.

  6. arXiv:2404.07045  [pdf, other

    cs.CV

    Identification of Fine-grained Systematic Errors via Controlled Scene Generation

    Authors: Valentyn Boreiko, Matthias Hein, Jan Hendrik Metzen

    Abstract: Many safety-critical applications, especially in autonomous driving, require reliable object detectors. They can be very effectively assisted by a method to search for and identify potential failures and systematic errors before these detectors are deployed. Systematic errors are characterized by combinations of attributes such as object location, scale, orientation, and color, as well as the comp… ▽ More

    Submitted 10 April, 2024; originally announced April 2024.

  7. arXiv:2402.12336  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    Robust CLIP: Unsupervised Adversarial Fine-Tuning of Vision Embeddings for Robust Large Vision-Language Models

    Authors: Christian Schlarmann, Naman Deep Singh, Francesco Croce, Matthias Hein

    Abstract: Multi-modal foundation models like OpenFlamingo, LLaVA, and GPT-4 are increasingly used for various real-world tasks. Prior work has shown that these models are highly vulnerable to adversarial attacks on the vision modality. These attacks can be leveraged to spread fake information or defraud users, and thus pose a significant risk, which makes the robustness of large multi-modal foundation model… ▽ More

    Submitted 5 June, 2024; v1 submitted 19 February, 2024; originally announced February 2024.

    Comments: ICML 2024 Oral

  8. arXiv:2311.17833  [pdf, other

    cs.CV cs.AI cs.LG

    DiG-IN: Diffusion Guidance for Investigating Networks -- Uncovering Classifier Differences Neuron Visualisations and Visual Counterfactual Explanations

    Authors: Maximilian Augustin, Yannic Neuhaus, Matthias Hein

    Abstract: While deep learning has led to huge progress in complex image classification tasks like ImageNet, unexpected failure modes, e.g. via spurious features, call into question how reliably these classifiers work in the wild. Furthermore, for safety-critical tasks the black-box nature of their decisions is problematic, and explanations or at least methods which make decisions plausible are needed urgent… ▽ More

    Submitted 12 July, 2024; v1 submitted 29 November, 2023; originally announced November 2023.

    Comments: CVPR 2024

  9. arXiv:2311.14450  [pdf, other

    cs.CV cs.CR cs.LG

    Segment (Almost) Nothing: Prompt-Agnostic Adversarial Attacks on Segmentation Models

    Authors: Francesco Croce, Matthias Hein

    Abstract: General purpose segmentation models are able to generate (semantic) segmentation masks from a variety of prompts, including visual (points, boxed, etc.) and textual (object names) ones. In particular, input images are pre-processed by an image encoder to obtain embedding vectors which are later used for mask predictions. Existing adversarial attacks target the end-to-end tasks, i.e. aim at alterin… ▽ More

    Submitted 24 November, 2023; originally announced November 2023.

  10. arXiv:2311.11629  [pdf, other

    cs.CV cs.LG

    Generating Realistic Counterfactuals for Retinal Fundus and OCT Images using Diffusion Models

    Authors: Indu Ilanchezian, Valentyn Boreiko, Laura Kühlewein, Ziwei Huang, Murat Seçkin Ayhan, Matthias Hein, Lisa Koch, Philipp Berens

    Abstract: Counterfactual reasoning is often used in clinical settings to explain decisions or weigh alternatives. Therefore, for imaging based specialties such as ophthalmology, it would be beneficial to be able to create counterfactual images, illustrating answers to questions like "If the subject had had diabetic retinopathy, how would the fundus image have looked?". Here, we demonstrate that using a diff… ▽ More

    Submitted 4 December, 2023; v1 submitted 20 November, 2023; originally announced November 2023.

  11. arXiv:2309.13489  [pdf, other

    cs.CV

    Identifying Systematic Errors in Object Detectors with the SCROD Pipeline

    Authors: Valentyn Boreiko, Matthias Hein, Jan Hendrik Metzen

    Abstract: The identification and removal of systematic errors in object detectors can be a prerequisite for their deployment in safety-critical applications like automated driving and robotics. Such systematic errors can for instance occur under very specific object poses (location, scale, orientation), object colors/textures, and backgrounds. Real images alone are unlikely to cover all relevant combination… ▽ More

    Submitted 23 September, 2023; originally announced September 2023.

  12. arXiv:2308.10741  [pdf, other

    cs.LG cs.AI cs.CR

    On the Adversarial Robustness of Multi-Modal Foundation Models

    Authors: Christian Schlarmann, Matthias Hein

    Abstract: Multi-modal foundation models combining vision and language models such as Flamingo or GPT-4 have recently gained enormous interest. Alignment of foundation models is used to prevent models from providing toxic or harmful output. While malicious users have successfully tried to jailbreak foundation models, an equally important question is if honest users could be harmed by malicious third-party co… ▽ More

    Submitted 21 August, 2023; originally announced August 2023.

    Comments: ICCV AROW 2023

  13. arXiv:2306.12941  [pdf, other

    cs.CV cs.LG

    Towards Reliable Evaluation and Fast Training of Robust Semantic Segmentation Models

    Authors: Francesco Croce, Naman D Singh, Matthias Hein

    Abstract: Adversarial robustness has been studied extensively in image classification, especially for the $\ell_\infty$-threat model, but significantly less so for related tasks such as object detection and semantic segmentation, where attacks turn out to be a much harder optimization problem than for image classification. We propose several problem-specific novel attacks minimizing different metrics in acc… ▽ More

    Submitted 16 July, 2024; v1 submitted 22 June, 2023; originally announced June 2023.

    Comments: ECCV 2024

  14. arXiv:2306.04226  [pdf, other

    cs.LG cs.CV

    Normalization Layers Are All That Sharpness-Aware Minimization Needs

    Authors: Maximilian Mueller, Tiffany Vlaar, David Rolnick, Matthias Hein

    Abstract: Sharpness-aware minimization (SAM) was proposed to reduce sharpness of minima and has been shown to enhance generalization performance in various settings. In this work we show that perturbing only the affine normalization parameters (typically comprising 0.1% of the total parameters) in the adversarial step of SAM can outperform perturbing all of the parameters.This finding generalizes to differe… ▽ More

    Submitted 17 November, 2023; v1 submitted 7 June, 2023; originally announced June 2023.

    Comments: camera ready version

  15. arXiv:2306.00826  [pdf, other

    cs.LG cs.CV

    In or Out? Fixing ImageNet Out-of-Distribution Detection Evaluation

    Authors: Julian Bitterwolf, Maximilian Müller, Matthias Hein

    Abstract: Out-of-distribution (OOD) detection is the problem of identifying inputs which are unrelated to the in-distribution task. The OOD detection performance when the in-distribution (ID) is ImageNet-1K is commonly being tested on a small range of test OOD datasets. We find that most of the currently used test OOD datasets, including datasets from the open set recognition (OSR) literature, have severe i… ▽ More

    Submitted 1 June, 2023; originally announced June 2023.

    Comments: ICML 2023. Datasets, code and evaluation data at https://github.com/j-cb/NINCO

  16. arXiv:2303.01870  [pdf, other

    cs.CV cs.CR cs.LG

    Revisiting Adversarial Training for ImageNet: Architectures, Training and Generalization across Threat Models

    Authors: Naman D Singh, Francesco Croce, Matthias Hein

    Abstract: While adversarial training has been extensively studied for ResNet architectures and low resolution datasets like CIFAR, much less is known for ImageNet. Given the recent debate about whether transformers are more robust than convnets, we revisit adversarial training on ImageNet comparing ViTs and ConvNeXts. Extensive experiments show that minor changes in architecture, most notably replacing Patc… ▽ More

    Submitted 28 October, 2023; v1 submitted 3 March, 2023; originally announced March 2023.

    Comments: Accepted at NeurIPS 2023

  17. arXiv:2302.07011  [pdf, other

    cs.LG

    A Modern Look at the Relationship between Sharpness and Generalization

    Authors: Maksym Andriushchenko, Francesco Croce, Maximilian Müller, Matthias Hein, Nicolas Flammarion

    Abstract: Sharpness of minima is a promising quantity that can correlate with generalization in deep networks and, when optimized during training, can improve generalization. However, standard sharpness is not invariant under reparametrizations of neural networks, and, to fix this, reparametrization-invariant sharpness definitions have been proposed, most prominently adaptive sharpness (Kwon et al., 2021).… ▽ More

    Submitted 7 June, 2023; v1 submitted 14 February, 2023; originally announced February 2023.

    Comments: The camera-ready version (accepted at ICML 2023)

  18. arXiv:2212.04871  [pdf, other

    cs.CV cs.LG

    Spurious Features Everywhere -- Large-Scale Detection of Harmful Spurious Features in ImageNet

    Authors: Yannic Neuhaus, Maximilian Augustin, Valentyn Boreiko, Matthias Hein

    Abstract: Benchmark performance of deep learning classifiers alone is not a reliable predictor for the performance of a deployed model. In particular, if the image classifier has picked up spurious features in the training data, its predictions can fail in unexpected ways. In this paper, we develop a framework that allows us to systematically identify spurious features in large datasets like ImageNet. It is… ▽ More

    Submitted 22 August, 2023; v1 submitted 9 December, 2022; originally announced December 2022.

    Comments: accepted to ICCV 2023

  19. arXiv:2210.11841  [pdf, other

    cs.CV cs.LG

    Diffusion Visual Counterfactual Explanations

    Authors: Maximilian Augustin, Valentyn Boreiko, Francesco Croce, Matthias Hein

    Abstract: Visual Counterfactual Explanations (VCEs) are an important tool to understand the decisions of an image classifier. They are 'small' but 'realistic' semantic changes of the image changing the classifier decision. Current approaches for the generation of VCEs are restricted to adversarially robust models and often contain non-realistic artefacts, or are limited to image classification problems with… ▽ More

    Submitted 21 October, 2022; originally announced October 2022.

    Comments: NeurIPS 2022

  20. arXiv:2210.10653  [pdf, other

    astro-ph.IM cs.OH physics.space-ph

    Agile Systems Engineering for sub-CubeSat scale spacecraft

    Authors: Konstantinos Kanavouras, Andreas Makoto Hein, Maanasa Sachidanand

    Abstract: Space systems miniaturization has been increasingly popular for the past decades, with over 1600 CubeSats and 300 sub-CubeSat sized spacecraft estimated to have been launched since 1998. This trend towards decreasing size enables the execution of unprecedented missions in terms of quantity, cost and development time, allowing for massively distributed satellite networks, and rapid prototyping of s… ▽ More

    Submitted 14 October, 2022; originally announced October 2022.

    Comments: 15 pages, 6 figures, 3 tables, presented in the 73rd International Astronautical Congress

  21. arXiv:2209.06953  [pdf, other

    cs.CV cs.LG

    On the interplay of adversarial robustness and architecture components: patches, convolution and attention

    Authors: Francesco Croce, Matthias Hein

    Abstract: In recent years novel architecture components for image classification have been developed, starting with attention and patches used in transformers. While prior works have analyzed the influence of some aspects of architecture components on the robustness to adversarial attacks, in particular for vision transformers, the understanding of the main factors is still limited. We compare several (non)… ▽ More

    Submitted 14 September, 2022; originally announced September 2022.

    Comments: Presented at the "New Frontiers in Adversarial Machine Learning" Workshop at ICML 2022

  22. arXiv:2209.05980  [pdf, other

    cs.CV cs.AI cs.CR cs.LG

    Certified Defences Against Adversarial Patch Attacks on Semantic Segmentation

    Authors: Maksym Yatsura, Kaspar Sakmann, N. Grace Hua, Matthias Hein, Jan Hendrik Metzen

    Abstract: Adversarial patch attacks are an emerging security threat for real world deep learning applications. We present Demasked Smoothing, the first approach (up to our knowledge) to certify the robustness of semantic segmentation models against this threat model. Previous work on certifiably defending against patch attacks has mostly focused on image classification task and often required changes in the… ▽ More

    Submitted 21 February, 2023; v1 submitted 13 September, 2022; originally announced September 2022.

    Comments: accepted at ICLR 2023

  23. arXiv:2208.03161  [pdf, ps, other

    eess.IV cs.CV

    Adversarial Robustness of MR Image Reconstruction under Realistic Perturbations

    Authors: Jan Nikolas Morshuis, Sergios Gatidis, Matthias Hein, Christian F. Baumgartner

    Abstract: Deep Learning (DL) methods have shown promising results for solving ill-posed inverse problems such as MR image reconstruction from undersampled $k$-space data. However, these approaches currently have no guarantees for reconstruction quality and the reliability of such algorithms is only poorly understood. Adversarial attacks offer a valuable tool to understand possible failure modes and worst ca… ▽ More

    Submitted 5 August, 2022; originally announced August 2022.

    Comments: Accepted at the MICCAI-2022 workshop: Machine Learning for Medical Image Reconstruction

  24. arXiv:2207.07209  [pdf, ps, other

    cs.LG cs.CR

    Sound Randomized Smoothing in Floating-Point Arithmetics

    Authors: Václav Voráček, Matthias Hein

    Abstract: Randomized smoothing is sound when using infinite precision. However, we show that randomized smoothing is no longer sound for limited floating-point precision. We present a simple example where randomized smoothing certifies a radius of $1.26$ around a point, even though there is an adversarial example in the distance $0.8$ and extend this example further to provide false certificates for CIFAR10… ▽ More

    Submitted 25 April, 2023; v1 submitted 14 July, 2022; originally announced July 2022.

    Comments: Accepted ICLR 2023

  25. arXiv:2207.07208  [pdf, other

    cs.LG

    Provably Adversarially Robust Nearest Prototype Classifiers

    Authors: Václav Voráček, Matthias Hein

    Abstract: Nearest prototype classifiers (NPCs) assign to each input point the label of the nearest prototype with respect to a chosen distance metric. A direct advantage of NPCs is that the decisions are interpretable. Previous work could provide lower bounds on the minimal adversarial perturbation in the $\ell_p$-threat model when using the same $\ell_p$-distance for the NPCs. In this paper we provide a co… ▽ More

    Submitted 14 July, 2022; originally announced July 2022.

    Comments: Accepted at ICML 2022

  26. arXiv:2206.09880  [pdf, ps, other

    cs.LG cs.CV

    Breaking Down Out-of-Distribution Detection: Many Methods Based on OOD Training Data Estimate a Combination of the Same Core Quantities

    Authors: Julian Bitterwolf, Alexander Meinke, Maximilian Augustin, Matthias Hein

    Abstract: It is an important problem in trustworthy machine learning to recognize out-of-distribution (OOD) inputs which are inputs unrelated to the in-distribution task. Many out-of-distribution detection methods have been suggested in recent years. The goal of this paper is to recognize common objectives as well as to identify the implicit scoring functions of different OOD detection methods. We focus on… ▽ More

    Submitted 20 June, 2022; originally announced June 2022.

  27. Sparse Visual Counterfactual Explanations in Image Space

    Authors: Valentyn Boreiko, Maximilian Augustin, Francesco Croce, Philipp Berens, Matthias Hein

    Abstract: Visual counterfactual explanations (VCEs) in image space are an important tool to understand decisions of image classifiers as they show under which changes of the image the decision of the classifier would change. Their generation in image space is challenging and requires robust models due to the problem of adversarial examples. Existing techniques to generate VCEs in image space suffer from spu… ▽ More

    Submitted 29 September, 2022; v1 submitted 16 May, 2022; originally announced May 2022.

    Journal ref: GCPR 2022

  28. arXiv:2202.13711  [pdf, other

    cs.LG cs.CR cs.CV

    Evaluating the Adversarial Robustness of Adaptive Test-time Defenses

    Authors: Francesco Croce, Sven Gowal, Thomas Brunner, Evan Shelhamer, Matthias Hein, Taylan Cemgil

    Abstract: Adaptive defenses, which optimize at test time, promise to improve adversarial robustness. We categorize such adaptive test-time defenses, explain their potential benefits and drawbacks, and evaluate a representative variety of the latest adaptive defenses for image classification. Unfortunately, none significantly improve upon static defenses when subjected to our careful case study evaluation. S… ▽ More

    Submitted 13 July, 2022; v1 submitted 28 February, 2022; originally announced February 2022.

    Comments: ICML'22

  29. arXiv:2111.01714  [pdf, other

    cs.LG cs.AI cs.CV

    Meta-Learning the Search Distribution of Black-Box Random Search Based Adversarial Attacks

    Authors: Maksym Yatsura, Jan Hendrik Metzen, Matthias Hein

    Abstract: Adversarial attacks based on randomized search schemes have obtained state-of-the-art results in black-box robustness evaluation recently. However, as we demonstrate in this work, their efficiency in different query budget regimes depends on manual design and heuristic tuning of the underlying proposal distributions. We study how this issue can be addressed by adapting the proposal distribution on… ▽ More

    Submitted 22 November, 2021; v1 submitted 2 November, 2021; originally announced November 2021.

    Comments: accepted at NeurIPS 2021; updated the numbers in Table 5 and added references; added acknowledgements

  30. arXiv:2106.10065  [pdf, other

    cs.LG stat.ML

    Being a Bit Frequentist Improves Bayesian Neural Networks

    Authors: Agustinus Kristiadi, Matthias Hein, Philipp Hennig

    Abstract: Despite their compelling theoretical properties, Bayesian neural networks (BNNs) tend to perform worse than frequentist methods in classification-based uncertainty quantification (UQ) tasks such as out-of-distribution (OOD) detection. In this paper, based on empirical findings in prior works, we hypothesize that this issue is because even recent Bayesian methods have never considered OOD data in t… ▽ More

    Submitted 2 February, 2022; v1 submitted 18 June, 2021; originally announced June 2021.

    Comments: AISTATS 2022

  31. arXiv:2106.04260  [pdf, other

    cs.LG cs.AI cs.CV

    Provably Robust Detection of Out-of-distribution Data (almost) for free

    Authors: Alexander Meinke, Julian Bitterwolf, Matthias Hein

    Abstract: The application of machine learning in safety-critical systems requires a reliable assessment of uncertainty. However, deep neural networks are known to produce highly overconfident predictions on out-of-distribution (OOD) data. Even if trained to be non-confident on OOD data, one can still adversarially manipulate OOD data so that the classifier again assigns high confidence to the manipulated sa… ▽ More

    Submitted 18 October, 2022; v1 submitted 8 June, 2021; originally announced June 2021.

  32. arXiv:2105.12508  [pdf, other

    cs.LG cs.CR cs.CV

    Adversarial Robustness against Multiple and Single $l_p$-Threat Models via Quick Fine-Tuning of Robust Classifiers

    Authors: Francesco Croce, Matthias Hein

    Abstract: A major drawback of adversarially robust models, in particular for large scale datasets like ImageNet, is the extremely long training time compared to standard ones. Moreover, models should be robust not only to one $l_p$-threat model but ideally to all of them. In this paper we propose Extreme norm Adversarial Training (E-AT) for multiple-norm robustness which is based on geometric properties of… ▽ More

    Submitted 7 August, 2022; v1 submitted 26 May, 2021; originally announced May 2021.

    Comments: ICML 2022

  33. arXiv:2104.08323  [pdf, other

    cs.LG cs.AR cs.CR cs.CV

    Random and Adversarial Bit Error Robustness: Energy-Efficient and Secure DNN Accelerators

    Authors: David Stutz, Nandhini Chandramoorthy, Matthias Hein, Bernt Schiele

    Abstract: Deep neural network (DNN) accelerators received considerable attention in recent years due to the potential to save energy compared to mainstream hardware. Low-voltage operation of DNN accelerators allows to further reduce energy consumption, however, causes bit-level failures in the memory storing the quantized weights. Furthermore, DNN accelerators are vulnerable to adversarial attacks on voltag… ▽ More

    Submitted 7 June, 2022; v1 submitted 16 April, 2021; originally announced April 2021.

  34. arXiv:2104.04448  [pdf, other

    cs.LG cs.CV stat.ML

    Relating Adversarially Robust Generalization to Flat Minima

    Authors: David Stutz, Matthias Hein, Bernt Schiele

    Abstract: Adversarial training (AT) has become the de-facto standard to obtain models robust against adversarial examples. However, AT exhibits severe robust overfitting: cross-entropy loss on adversarial examples, so-called robust loss, decreases continuously on training examples, while eventually increasing on test examples. In practice, this leads to poor robust generalization, i.e., adversarial robustne… ▽ More

    Submitted 6 October, 2021; v1 submitted 9 April, 2021; originally announced April 2021.

    Comments: ICCV'21

  35. arXiv:2103.01208  [pdf, other

    cs.LG cs.CV

    Mind the box: $l_1$-APGD for sparse adversarial attacks on image classifiers

    Authors: Francesco Croce, Matthias Hein

    Abstract: We show that when taking into account also the image domain $[0,1]^d$, established $l_1$-projected gradient descent (PGD) attacks are suboptimal as they do not consider that the effective threat model is the intersection of the $l_1$-ball and $[0,1]^d$. We study the expected sparsity of the steepest descent step for this effective threat model and show that the exact projection onto this set is co… ▽ More

    Submitted 24 November, 2023; v1 submitted 1 March, 2021; originally announced March 2021.

    Comments: In ICML 2021. Fixed typos in Eq. (3) and Eq. (4)

  36. arXiv:2012.12372  [pdf, other

    cs.LG cs.CV

    Out-distribution aware Self-training in an Open World Setting

    Authors: Maximilian Augustin, Matthias Hein

    Abstract: Deep Learning heavily depends on large labeled datasets which limits further improvements. While unlabeled data is available in large amounts, in particular in image recognition, it does not fulfill the closed world assumption of semi-supervised learning that all unlabeled data are task-related. The goal of this paper is to leverage unlabeled data in an open world setting to further improve predic… ▽ More

    Submitted 21 December, 2020; originally announced December 2020.

  37. arXiv:2010.09670  [pdf, other

    cs.LG cs.CR cs.CV stat.ML

    RobustBench: a standardized adversarial robustness benchmark

    Authors: Francesco Croce, Maksym Andriushchenko, Vikash Sehwag, Edoardo Debenedetti, Nicolas Flammarion, Mung Chiang, Prateek Mittal, Matthias Hein

    Abstract: As a research community, we are still lacking a systematic understanding of the progress on adversarial robustness which often makes it hard to identify the most promising ideas in training robust models. A key challenge in benchmarking robustness is that its evaluation is often error-prone leading to robustness overestimation. Our goal is to establish a standardized benchmark of adversarial robus… ▽ More

    Submitted 31 October, 2021; v1 submitted 19 October, 2020; originally announced October 2020.

    Comments: The camera-ready version accepted at the NeurIPS'21 Datasets and Benchmarks Track: 120+ evaluations, 80+ models, 7 leaderboards (Linf, L2, common corruptions; CIFAR-10, CIFAR-100, ImageNet), significantly expanded analysis part (calibration, fairness, privacy leakage, smoothness, transferability)

  38. arXiv:2010.02720  [pdf, other

    cs.LG

    Learnable Uncertainty under Laplace Approximations

    Authors: Agustinus Kristiadi, Matthias Hein, Philipp Hennig

    Abstract: Laplace approximations are classic, computationally lightweight means for constructing Bayesian neural networks (BNNs). As in other approximate BNNs, one cannot necessarily expect the induced predictive uncertainty to be calibrated. Here we develop a formalism to explicitly "train" the uncertainty in a decoupled way to the prediction itself. To this end, we introduce uncertainty units for Laplace-… ▽ More

    Submitted 7 June, 2021; v1 submitted 6 October, 2020; originally announced October 2020.

    Comments: UAI 2021

  39. arXiv:2010.02709  [pdf, other

    cs.LG stat.ML

    An Infinite-Feature Extension for Bayesian ReLU Nets That Fixes Their Asymptotic Overconfidence

    Authors: Agustinus Kristiadi, Matthias Hein, Philipp Hennig

    Abstract: A Bayesian treatment can mitigate overconfidence in ReLU nets around the training data. But far away from them, ReLU Bayesian neural networks (BNNs) can still underestimate uncertainty and thus be asymptotically overconfident. This issue arises since the output variance of a BNN with finitely many features is quadratic in the distance from the data region. Meanwhile, Bayesian linear models with Re… ▽ More

    Submitted 24 January, 2022; v1 submitted 6 October, 2020; originally announced October 2020.

    Comments: NeurIPS 2021

  40. arXiv:2007.08473  [pdf, other

    cs.LG cs.CV stat.ML

    Certifiably Adversarially Robust Detection of Out-of-Distribution Data

    Authors: Julian Bitterwolf, Alexander Meinke, Matthias Hein

    Abstract: Deep neural networks are known to be overconfident when applied to out-of-distribution (OOD) inputs which clearly do not belong to any class. This is a problem in safety-critical applications since a reliable assessment of the uncertainty of a classifier is a key property, allowing the system to trigger human intervention or to transfer into a safe state. In this paper, we aim for certifiable wors… ▽ More

    Submitted 10 March, 2021; v1 submitted 16 July, 2020; originally announced July 2020.

    Comments: Published and presented at NeurIPS 2020. Code available at https://gitlab.com/Bitterwolf/GOOD v3: added missing acknowledgement

    Journal ref: Advances in Neural Information Processing Systems 33 (NeurIPS 2020)

  41. arXiv:2006.13977  [pdf, other

    cs.LG cs.AR cs.CR cs.CV stat.ML

    Bit Error Robustness for Energy-Efficient DNN Accelerators

    Authors: David Stutz, Nandhini Chandramoorthy, Matthias Hein, Bernt Schiele

    Abstract: Deep neural network (DNN) accelerators received considerable attention in past years due to saved energy compared to mainstream hardware. Low-voltage operation of DNN accelerators allows to further reduce energy consumption significantly, however, causes bit-level failures in the memory storing the quantized DNN weights. In this paper, we show that a combination of robust fixed-point quantization,… ▽ More

    Submitted 9 April, 2021; v1 submitted 24 June, 2020; originally announced June 2020.

  42. arXiv:2006.12834  [pdf, other

    cs.LG cs.CR cs.CV stat.ML

    Sparse-RS: a versatile framework for query-efficient sparse black-box adversarial attacks

    Authors: Francesco Croce, Maksym Andriushchenko, Naman D. Singh, Nicolas Flammarion, Matthias Hein

    Abstract: We propose a versatile framework based on random search, Sparse-RS, for score-based sparse targeted and untargeted attacks in the black-box setting. Sparse-RS does not rely on substitute models and achieves state-of-the-art success rate and query efficiency for multiple sparse attack models: $l_0$-bounded perturbations, adversarial patches, and adversarial frames. The $l_0$-version of untargeted S… ▽ More

    Submitted 7 February, 2022; v1 submitted 23 June, 2020; originally announced June 2020.

    Comments: Accepted at AAAI 2022. This version contains considerably extended results in the L0 threat model

  43. arXiv:2003.09461  [pdf, other

    cs.LG cs.CV stat.ML

    Adversarial Robustness on In- and Out-Distribution Improves Explainability

    Authors: Maximilian Augustin, Alexander Meinke, Matthias Hein

    Abstract: Neural networks have led to major improvements in image classification but suffer from being non-robust to adversarial changes, unreliable uncertainty estimates on out-distribution samples and their inscrutable black-box decisions. In this work we propose RATIO, a training procedure for Robustness via Adversarial Training on In- and Out-distribution, which leads to robust models with reliable and… ▽ More

    Submitted 29 July, 2020; v1 submitted 20 March, 2020; originally announced March 2020.

  44. arXiv:2003.01690  [pdf, other

    cs.LG cs.CV stat.ML

    Reliable evaluation of adversarial robustness with an ensemble of diverse parameter-free attacks

    Authors: Francesco Croce, Matthias Hein

    Abstract: The field of defense strategies against adversarial attacks has significantly grown over the last years, but progress is hampered as the evaluation of adversarial defenses is often insufficient and thus gives a wrong impression of robustness. Many promising defenses could be broken later on, making it difficult to identify the state-of-the-art. Frequent pitfalls in the evaluation are improper tuni… ▽ More

    Submitted 4 August, 2020; v1 submitted 3 March, 2020; originally announced March 2020.

    Comments: In ICML 2020

  45. arXiv:2002.10118  [pdf, other

    stat.ML cs.LG

    Being Bayesian, Even Just a Bit, Fixes Overconfidence in ReLU Networks

    Authors: Agustinus Kristiadi, Matthias Hein, Philipp Hennig

    Abstract: The point estimates of ReLU classification networks---arguably the most widely used neural network architecture---have been shown to yield arbitrarily high confidence far away from the training data. This architecture, in conjunction with a maximum a posteriori estimation scheme, is thus not calibrated nor robust. Approximate Bayesian inference has been empirically demonstrated to improve predicti… ▽ More

    Submitted 17 July, 2020; v1 submitted 24 February, 2020; originally announced February 2020.

    Comments: ICML 2020

  46. arXiv:2002.02447  [pdf, other

    math.NA cs.CC math.OC

    Computing the norm of nonnegative matrices and the log-Sobolev constant of Markov chains

    Authors: Antoine Gautier, Matthias Hein, Francesco Tudisco

    Abstract: We analyze the global convergence of the power iterates for the computation of a general mixed-subordinate matrix norm. We prove a new global convergence theorem for a class of entrywise nonnegative matrices that generalizes and improves a well-known results for mixed-subordinate $\ell^p$ matrix norms. In particular, exploiting the Birkoff--Hopf contraction ratio of nonnegative matrices, we obtain… ▽ More

    Submitted 6 February, 2020; originally announced February 2020.

    MSC Class: 65F35; 15B48; 60J10; 47H09; 47H10

  47. arXiv:1912.00049  [pdf, other

    cs.LG cs.CR cs.CV stat.ML

    Square Attack: a query-efficient black-box adversarial attack via random search

    Authors: Maksym Andriushchenko, Francesco Croce, Nicolas Flammarion, Matthias Hein

    Abstract: We propose the Square Attack, a score-based black-box $l_2$- and $l_\infty$-adversarial attack that does not rely on local gradient information and thus is not affected by gradient masking. Square Attack is based on a randomized search scheme which selects localized square-shaped updates at random positions so that at each iteration the perturbation is situated approximately at the boundary of the… ▽ More

    Submitted 29 July, 2020; v1 submitted 29 November, 2019; originally announced December 2019.

    Comments: Accepted at ECCV 2020; added imperceptible perturbations, analysis of examples that require more queries, results on dilated CNNs

  48. arXiv:1910.13951  [pdf, other

    cs.LG math.NA stat.ML

    Generalized Matrix Means for Semi-Supervised Learning with Multilayer Graphs

    Authors: Pedro Mercado, Francesco Tudisco, Matthias Hein

    Abstract: We study the task of semi-supervised learning on multilayer graphs by taking into account both labeled and unlabeled observations together with the information encoded by each individual graph layer. We propose a regularizer based on the generalized matrix mean, which is a one-parameter family of matrix means that includes the arithmetic, geometric and harmonic means as particular cases. We analyz… ▽ More

    Submitted 30 October, 2019; originally announced October 2019.

    Comments: Accepted in NeurIPS 2019

  49. arXiv:1910.06259  [pdf, other

    cs.LG cs.CR cs.CV stat.ML

    Confidence-Calibrated Adversarial Training: Generalizing to Unseen Attacks

    Authors: David Stutz, Matthias Hein, Bernt Schiele

    Abstract: Adversarial training yields robust models against a specific threat model, e.g., $L_\infty$ adversarial examples. Typically robustness does not generalize to previously unseen threat models, e.g., other $L_p$ norms, or larger perturbations. Our confidence-calibrated adversarial training (CCAT) tackles this problem by biasing the model towards low confidence predictions on adversarial examples. By… ▽ More

    Submitted 30 June, 2020; v1 submitted 14 October, 2019; originally announced October 2019.

  50. arXiv:1909.12180  [pdf, ps, other

    cs.LG cs.CV stat.ML

    Towards neural networks that provably know when they don't know

    Authors: Alexander Meinke, Matthias Hein

    Abstract: It has recently been shown that ReLU networks produce arbitrarily over-confident predictions far away from the training data. Thus, ReLU networks do not know when they don't know. However, this is a highly important property in safety critical applications. In the context of out-of-distribution detection (OOD) there have been a number of proposals to mitigate this problem but none of them are able… ▽ More

    Submitted 21 February, 2020; v1 submitted 26 September, 2019; originally announced September 2019.