Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–3 of 3 results for author: Caro, J O

Searching in archive stat. Search in all archives.
.
  1. arXiv:2302.03750  [pdf, other

    cs.CV cs.LG stat.ME

    Linking convolutional kernel size to generalization bias in face analysis CNNs

    Authors: Hao Liang, Josue Ortega Caro, Vikram Maheshri, Ankit B. Patel, Guha Balakrishnan

    Abstract: Training dataset biases are by far the most scrutinized factors when explaining algorithmic biases of neural networks. In contrast, hyperparameters related to the neural network architecture have largely been ignored even though different network parameterizations are known to induce different implicit biases over learned features. For example, convolutional kernel size is known to affect the freq… ▽ More

    Submitted 3 December, 2023; v1 submitted 7 February, 2023; originally announced February 2023.

    Comments: WACV 2024

  2. arXiv:2008.01772  [pdf

    cs.LG stat.ML

    Shallow Univariate ReLu Networks as Splines: Initialization, Loss Surface, Hessian, & Gradient Flow Dynamics

    Authors: Justin Sahs, Ryan Pyle, Aneel Damaraju, Josue Ortega Caro, Onur Tavaslioglu, Andy Lu, Ankit Patel

    Abstract: Understanding the learning dynamics and inductive bias of neural networks (NNs) is hindered by the opacity of the relationship between NN parameters and the function represented. We propose reparametrizing ReLU NNs as continuous piecewise linear splines. Using this spline lens, we study learning dynamics in shallow univariate ReLU NNs, finding unexpected insights and explanations for several perpl… ▽ More

    Submitted 4 August, 2020; originally announced August 2020.

    Comments: 14 pages, 4 figures in main text

    ACM Class: I.2.0

  3. arXiv:2006.11440  [pdf, other

    stat.ML cs.LG

    Local Convolutions Cause an Implicit Bias towards High Frequency Adversarial Examples

    Authors: Josue Ortega Caro, Yilong Ju, Ryan Pyle, Sourav Dey, Wieland Brendel, Fabio Anselmi, Ankit Patel

    Abstract: Adversarial Attacks are still a significant challenge for neural networks. Recent work has shown that adversarial perturbations typically contain high-frequency features, but the root cause of this phenomenon remains unknown. Inspired by theoretical work on linear full-width convolutional models, we hypothesize that the local (i.e. bounded-width) convolutional operations commonly used in current n… ▽ More

    Submitted 8 March, 2023; v1 submitted 19 June, 2020; originally announced June 2020.

    Comments: 23 pages, 11 figures, 12 Tables