Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–14 of 14 results for author: Sharma, Y

Searching in archive stat. Search in all archives.
.
  1. arXiv:2401.04890  [pdf, other

    stat.ML cs.LG

    Nonparametric Partial Disentanglement via Mechanism Sparsity: Sparse Actions, Interventions and Sparse Temporal Dependencies

    Authors: Sébastien Lachapelle, Pau Rodríguez López, Yash Sharma, Katie Everett, Rémi Le Priol, Alexandre Lacoste, Simon Lacoste-Julien

    Abstract: This work introduces a novel principle for disentanglement we call mechanism sparsity regularization, which applies when the latent factors of interest depend sparsely on observed auxiliary variables and/or past latent factors. We propose a representation learning method that induces disentanglement by simultaneously learning the latent factors and the sparse causal graphical model that explains t… ▽ More

    Submitted 9 January, 2024; originally announced January 2024.

    Comments: 88 pages

    ACM Class: I.2.6; I.5.1

  2. arXiv:2208.03835  [pdf, other

    cs.LG cs.AI stat.ML

    On Transfer of Adversarial Robustness from Pretraining to Downstream Tasks

    Authors: Laura Fee Nern, Harsh Raj, Maurice Georgi, Yash Sharma

    Abstract: As large-scale training regimes have gained popularity, the use of pretrained models for downstream tasks has become common practice in machine learning. While pretraining has been shown to enhance the performance of models in practice, the transfer of robustness properties from pretraining to downstream tasks remains poorly understood. In this study, we demonstrate that the robustness of a linear… ▽ More

    Submitted 9 October, 2023; v1 submitted 7 August, 2022; originally announced August 2022.

  3. arXiv:2107.10098  [pdf, other

    stat.ML cs.LG

    Disentanglement via Mechanism Sparsity Regularization: A New Principle for Nonlinear ICA

    Authors: Sébastien Lachapelle, Pau Rodríguez López, Yash Sharma, Katie Everett, Rémi Le Priol, Alexandre Lacoste, Simon Lacoste-Julien

    Abstract: This work introduces a novel principle we call disentanglement via mechanism sparsity regularization, which can be applied when the latent factors of interest depend sparsely on past latent factors and/or observed auxiliary variables. We propose a representation learning method that induces disentanglement by simultaneously learning the latent factors and the sparse causal graphical model that rel… ▽ More

    Submitted 23 February, 2022; v1 submitted 21 July, 2021; originally announced July 2021.

    Comments: Appears in: 1st Conference on Causal Learning and Reasoning (CLeaR 2022). 57 pages

    ACM Class: I.2.6; I.5.1

  4. arXiv:2106.04619  [pdf, other

    stat.ML cs.AI cs.CV cs.LG

    Self-Supervised Learning with Data Augmentations Provably Isolates Content from Style

    Authors: Julius von Kügelgen, Yash Sharma, Luigi Gresele, Wieland Brendel, Bernhard Schölkopf, Michel Besserve, Francesco Locatello

    Abstract: Self-supervised representation learning has shown remarkable success in a number of domains. A common practice is to perform data augmentation via hand-crafted transformations intended to leave the semantics of the data invariant. We seek to understand the empirical success of this approach from a theoretical perspective. We formulate the augmentation process as a latent variable model by postulat… ▽ More

    Submitted 14 January, 2022; v1 submitted 8 June, 2021; originally announced June 2021.

    Comments: NeurIPS 2021 final camera-ready revision (with minor corrections)

  5. arXiv:2007.10930  [pdf, other

    stat.ML cs.CV cs.LG

    Towards Nonlinear Disentanglement in Natural Data with Temporal Sparse Coding

    Authors: David Klindt, Lukas Schott, Yash Sharma, Ivan Ustyuzhaninov, Wieland Brendel, Matthias Bethge, Dylan Paiton

    Abstract: We construct an unsupervised learning model that achieves nonlinear disentanglement of underlying factors of variation in naturalistic videos. Previous work suggests that representations can be disentangled if all but a few factors in the environment stay constant at any point in time. As a result, algorithms proposed for this problem have only been tested on carefully constructed datasets with th… ▽ More

    Submitted 17 March, 2021; v1 submitted 21 July, 2020; originally announced July 2020.

    Comments: ICLR 2021. Code is available at https://github.com/bethgelab/slow_disentanglement. The first three authors, as well as the last two authors, contributed equally

  6. arXiv:2007.06533  [pdf, other

    cs.LG stat.ML

    S2RMs: Spatially Structured Recurrent Modules

    Authors: Nasim Rahaman, Anirudh Goyal, Muhammad Waleed Gondal, Manuel Wuthrich, Stefan Bauer, Yash Sharma, Yoshua Bengio, Bernhard Schölkopf

    Abstract: Capturing the structure of a data-generating process by means of appropriate inductive biases can help in learning models that generalize well and are robust to changes in the input distribution. While methods that harness spatial and temporal structures find broad application, recent work has demonstrated the potential of models that leverage sparse and modular structure using an ensemble of spar… ▽ More

    Submitted 13 July, 2020; originally announced July 2020.

  7. arXiv:1903.00073  [pdf, other

    cs.CV cs.CR cs.LG stat.ML

    On the Effectiveness of Low Frequency Perturbations

    Authors: Yash Sharma, Gavin Weiguang Ding, Marcus Brubaker

    Abstract: Carefully crafted, often imperceptible, adversarial perturbations have been shown to cause state-of-the-art models to yield extremely inaccurate outputs, rendering them unsuitable for safety-critical application domains. In addition, recent work has shown that constraining the attack space to a low frequency regime is particularly effective. Yet, it remains unclear whether this is due to generally… ▽ More

    Submitted 31 May, 2019; v1 submitted 28 February, 2019; originally announced March 2019.

    Comments: IJCAI 2019

  8. arXiv:1812.02637  [pdf, other

    cs.LG cs.NE stat.ML

    MMA Training: Direct Input Space Margin Maximization through Adversarial Training

    Authors: Gavin Weiguang Ding, Yash Sharma, Kry Yik Chau Lui, Ruitong Huang

    Abstract: We study adversarial robustness of neural networks from a margin maximization perspective, where margins are defined as the distances from inputs to a classifier's decision boundary. Our study shows that maximizing margins can be achieved by minimizing the adversarial loss on the decision boundary at the "shortest successful perturbation", demonstrating a close connection between adversarial losse… ▽ More

    Submitted 4 March, 2020; v1 submitted 6 December, 2018; originally announced December 2018.

    Comments: Published at the Eighth International Conference on Learning Representations (ICLR 2020), https://openreview.net/forum?id=HkeryxBtPB

  9. arXiv:1803.09868  [pdf, other

    stat.ML cs.LG

    Bypassing Feature Squeezing by Increasing Adversary Strength

    Authors: Yash Sharma, Pin-Yu Chen

    Abstract: Feature Squeezing is a recently proposed defense method which reduces the search space available to an adversary by coalescing samples that correspond to many different feature vectors in the original space into a single sample. It has been shown that feature squeezing defenses can be combined in a joint detection framework to achieve high detection rates against state-of-the-art attacks. However,… ▽ More

    Submitted 26 March, 2018; originally announced March 2018.

  10. arXiv:1802.06552  [pdf, other

    cs.LG stat.ML

    Are Generative Classifiers More Robust to Adversarial Attacks?

    Authors: Yingzhen Li, John Bradshaw, Yash Sharma

    Abstract: There is a rising interest in studying the robustness of deep neural network classifiers against adversaries, with both advanced attack and defence techniques being actively developed. However, most recent work focuses on discriminative classifiers, which only model the conditional distribution of the labels given the inputs. In this paper, we propose and investigate the deep Bayes classifier, whi… ▽ More

    Submitted 27 May, 2019; v1 submitted 19 February, 2018; originally announced February 2018.

    Comments: ICML 2019

  11. arXiv:1710.10733  [pdf, other

    stat.ML cs.CR cs.LG

    Attacking the Madry Defense Model with $L_1$-based Adversarial Examples

    Authors: Yash Sharma, Pin-Yu Chen

    Abstract: The Madry Lab recently hosted a competition designed to test the robustness of their adversarially trained MNIST model. Attacks were constrained to perturb each pixel of the input image by a scaled maximal $L_\infty$ distortion $ε$ = 0.3. This discourages the use of attacks which are not optimized on the $L_\infty$ distortion metric. Our experimental results demonstrate that by relaxing the… ▽ More

    Submitted 27 July, 2018; v1 submitted 29 October, 2017; originally announced October 2017.

    Comments: Accepted to ICLR 2018 Workshops

  12. arXiv:1709.04114  [pdf, other

    stat.ML cs.CR cs.LG

    EAD: Elastic-Net Attacks to Deep Neural Networks via Adversarial Examples

    Authors: Pin-Yu Chen, Yash Sharma, Huan Zhang, Jinfeng Yi, Cho-Jui Hsieh

    Abstract: Recent studies have highlighted the vulnerability of deep neural networks (DNNs) to adversarial examples - a visually indistinguishable adversarial image can easily be crafted to cause a well-trained model to misclassify. Existing methods for crafting adversarial examples are based on $L_2$ and $L_\infty$ distortion metrics. However, despite the fact that $L_1$ distortion accounts for the total va… ▽ More

    Submitted 9 February, 2018; v1 submitted 12 September, 2017; originally announced September 2017.

    Comments: To be published at AAAI 2018

  13. arXiv:1708.03999  [pdf, other

    stat.ML cs.CR cs.LG

    ZOO: Zeroth Order Optimization based Black-box Attacks to Deep Neural Networks without Training Substitute Models

    Authors: Pin-Yu Chen, Huan Zhang, Yash Sharma, Jinfeng Yi, Cho-Jui Hsieh

    Abstract: Deep neural networks (DNNs) are one of the most prominent technologies of our time, as they achieve state-of-the-art performance in many machine learning tasks, including but not limited to image classification, text mining, and speech processing. However, recent research on DNNs has indicated ever-increasing concern on the robustness to adversarial examples, especially for security-critical tasks… ▽ More

    Submitted 2 November, 2017; v1 submitted 13 August, 2017; originally announced August 2017.

    Comments: Accepted by 10th ACM Workshop on Artificial Intelligence and Security (AISEC) with the 24th ACM Conference on Computer and Communications Security (CCS)

  14. arXiv:1610.00768  [pdf, ps, other

    cs.LG cs.CR stat.ML

    Technical Report on the CleverHans v2.1.0 Adversarial Examples Library

    Authors: Nicolas Papernot, Fartash Faghri, Nicholas Carlini, Ian Goodfellow, Reuben Feinman, Alexey Kurakin, Cihang Xie, Yash Sharma, Tom Brown, Aurko Roy, Alexander Matyasko, Vahid Behzadan, Karen Hambardzumyan, Zhishuai Zhang, Yi-Lin Juang, Zhi Li, Ryan Sheatsley, Abhibhav Garg, Jonathan Uesato, Willi Gierke, Yinpeng Dong, David Berthelot, Paul Hendricks, Jonas Rauber, Rujun Long , et al. (1 additional authors not shown)

    Abstract: CleverHans is a software library that provides standardized reference implementations of adversarial example construction techniques and adversarial training. The library may be used to develop more robust machine learning models and to provide standardized benchmarks of models' performance in the adversarial setting. Benchmarks constructed without a standardized implementation of adversarial exam… ▽ More

    Submitted 27 June, 2018; v1 submitted 3 October, 2016; originally announced October 2016.

    Comments: Technical report for https://github.com/tensorflow/cleverhans