Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–28 of 28 results for author: Vincent, P

Searching in archive stat. Search in all archives.
.
  1. arXiv:2311.08815  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    Self-Supervised Disentanglement by Leveraging Structure in Data Augmentations

    Authors: Cian Eastwood, Julius von Kügelgen, Linus Ericsson, Diane Bouchacourt, Pascal Vincent, Bernhard Schölkopf, Mark Ibrahim

    Abstract: Self-supervised representation learning often uses data augmentations to induce some invariance to "style" attributes of the data. However, with downstream tasks generally unknown at training time, it is difficult to deduce a priori which attributes of the data are indeed "style" and can be safely discarded. To deal with this, current approaches try to retain some style information by tuning the d… ▽ More

    Submitted 20 August, 2024; v1 submitted 15 November, 2023; originally announced November 2023.

  2. arXiv:2309.16748  [pdf, other

    cs.LG cs.AI stat.ML

    Discovering environments with XRM

    Authors: Mohammad Pezeshki, Diane Bouchacourt, Mark Ibrahim, Nicolas Ballas, Pascal Vincent, David Lopez-Paz

    Abstract: Environment annotations are essential for the success of many out-of-distribution (OOD) generalization methods. Unfortunately, these are costly to obtain and often limited by human annotators' biases. To achieve robust generalization, it is essential to develop algorithms for automatic environment discovery within datasets. Current proposals, which divide examples based on their training error, su… ▽ More

    Submitted 19 July, 2024; v1 submitted 28 September, 2023; originally announced September 2023.

    Comments: Oral at ICML 2024

  3. arXiv:2210.07347  [pdf, other

    cs.LG stat.ML

    Disentanglement of Correlated Factors via Hausdorff Factorized Support

    Authors: Karsten Roth, Mark Ibrahim, Zeynep Akata, Pascal Vincent, Diane Bouchacourt

    Abstract: A grand goal in deep learning research is to learn representations capable of generalizing across distribution shifts. Disentanglement is one promising direction aimed at aligning a model's representation with the underlying factors generating the data (e.g. color or background). Existing disentanglement methods, however, rely on an often unrealistic assumption: that factors are statistically inde… ▽ More

    Submitted 25 February, 2023; v1 submitted 13 October, 2022; originally announced October 2022.

    Comments: Accepted to ICLR 2023

  4. arXiv:2103.03098  [pdf, other

    cs.LG stat.ML

    Accounting for Variance in Machine Learning Benchmarks

    Authors: Xavier Bouthillier, Pierre Delaunay, Mirko Bronzi, Assya Trofimov, Brennan Nichyporuk, Justin Szeto, Naz Sepah, Edward Raff, Kanika Madan, Vikram Voleti, Samira Ebrahimi Kahou, Vincent Michalski, Dmitriy Serdyuk, Tal Arbel, Chris Pal, Gaël Varoquaux, Pascal Vincent

    Abstract: Strong empirical evidence that one machine-learning algorithm A outperforms another one B ideally calls for multiple trials optimizing the learning pipeline over sources of variation such as data sampling, data augmentation, parameter initialization, and hyperparameters choices. This is prohibitively expensive, and corners are cut to reach conclusions. We model the whole benchmarking process, reve… ▽ More

    Submitted 1 March, 2021; originally announced March 2021.

    Comments: Submitted to MLSys2021

  5. arXiv:2010.12870  [pdf, ps, other

    cs.LG stat.ML

    Efficient Learning in Non-Stationary Linear Markov Decision Processes

    Authors: Ahmed Touati, Pascal Vincent

    Abstract: We study episodic reinforcement learning in non-stationary linear (a.k.a. low-rank) Markov Decision Processes (MDPs), i.e, both the reward and transition kernel are linear with respect to a given feature map and are allowed to evolve either slowly or abruptly over time. For this problem setting, we propose OPT-WLSVI an optimistic model-free algorithm based on weighted least squares value iteration… ▽ More

    Submitted 27 December, 2021; v1 submitted 24 October, 2020; originally announced October 2020.

  6. arXiv:2008.00938  [pdf, other

    cs.LG stat.ML

    Implicit Regularization via Neural Feature Alignment

    Authors: Aristide Baratin, Thomas George, César Laurent, R Devon Hjelm, Guillaume Lajoie, Pascal Vincent, Simon Lacoste-Julien

    Abstract: We approach the problem of implicit regularization in deep learning from a geometrical viewpoint. We highlight a regularization effect induced by a dynamical alignment of the neural tangent features introduced by Jacot et al, along a small number of task-relevant directions. This can be interpreted as a combined mechanism of feature selection and compression. By extrapolating a new analysis of Rad… ▽ More

    Submitted 16 March, 2021; v1 submitted 3 August, 2020; originally announced August 2020.

    Comments: AISTATS 2021

  7. arXiv:2007.04202  [pdf, other

    cs.LG cs.GT math.OC stat.ML

    Stochastic Hamiltonian Gradient Methods for Smooth Games

    Authors: Nicolas Loizou, Hugo Berard, Alexia Jolicoeur-Martineau, Pascal Vincent, Simon Lacoste-Julien, Ioannis Mitliagkas

    Abstract: The success of adversarial formulations in machine learning has brought renewed motivation for smooth games. In this work, we focus on the class of stochastic Hamiltonian methods and provide the first convergence guarantees for certain classes of stochastic smooth games. We propose a novel unbiased estimator for the stochastic Hamiltonian gradient descent (SHGD) and highlight its benefits. Using t… ▽ More

    Submitted 8 July, 2020; originally announced July 2020.

    Comments: ICML 2020 - Proceedings of the 37th International Conference on Machine Learning

  8. arXiv:2007.03749  [pdf, ps, other

    cs.LG stat.ML

    Sharp Analysis of Smoothed Bellman Error Embedding

    Authors: Ahmed Touati, Pascal Vincent

    Abstract: The \textit{Smoothed Bellman Error Embedding} algorithm~\citep{dai2018sbeed}, known as SBEED, was proposed as a provably convergent reinforcement learning algorithm with general nonlinear function approximation. It has been successfully implemented with neural networks and achieved strong empirical results. In this work, we study the theoretical behavior of SBEED in batch-mode reinforcement learni… ▽ More

    Submitted 7 July, 2020; originally announced July 2020.

    Comments: Accepted at the ICML 2020 Workshop on Theoretical Foundations of Reinforcement Learning

  9. arXiv:2007.00720  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    Adversarial Example Games

    Authors: Avishek Joey Bose, Gauthier Gidel, Hugo Berard, Andre Cianflone, Pascal Vincent, Simon Lacoste-Julien, William L. Hamilton

    Abstract: The existence of adversarial examples capable of fooling trained neural network classifiers calls for a much better understanding of possible attacks to guide the development of safeguards against them. This includes attack methods in the challenging non-interactive blackbox setting, where adversarial attacks are generated without any access, including queries, to the target model. Prior attacks i… ▽ More

    Submitted 8 January, 2021; v1 submitted 1 July, 2020; originally announced July 2020.

    Comments: Appears in: Advances in Neural Information Processing Systems 33 (NeurIPS 2020)

  10. arXiv:2006.12279  [pdf, other

    cs.LG stat.ML

    Revisiting Loss Modelling for Unstructured Pruning

    Authors: César Laurent, Camille Ballas, Thomas George, Nicolas Ballas, Pascal Vincent

    Abstract: By removing parameters from deep neural networks, unstructured pruning methods aim at cutting down memory footprint and computational cost, while maintaining prediction accuracy. In order to tackle this otherwise intractable problem, many of these methods model the loss landscape using first or second order Taylor expansions to identify which parameters can be discarded. We revisit loss modelling… ▽ More

    Submitted 22 June, 2020; originally announced June 2020.

  11. arXiv:2003.04108  [pdf, other

    cs.LG stat.ML

    Stable Policy Optimization via Off-Policy Divergence Regularization

    Authors: Ahmed Touati, Amy Zhang, Joelle Pineau, Pascal Vincent

    Abstract: Trust Region Policy Optimization (TRPO) and Proximal Policy Optimization (PPO) are among the most successful policy gradient approaches in deep reinforcement learning (RL). While these methods achieve state-of-the-art performance across a wide range of challenging tasks, there is room for improvement in the stabilization of the policy learning and how the off-policy data are used. In this paper we… ▽ More

    Submitted 19 June, 2020; v1 submitted 9 March, 2020; originally announced March 2020.

    Journal ref: Proceedings of the 36th Conference on Uncertainty in Artificial Intelligence (UAI), PMLR volume 124, 2020

  12. arXiv:1906.04848  [pdf, other

    cs.LG stat.ML

    A Closer Look at the Optimization Landscapes of Generative Adversarial Networks

    Authors: Hugo Berard, Gauthier Gidel, Amjad Almahairi, Pascal Vincent, Simon Lacoste-Julien

    Abstract: Generative adversarial networks have been very successful in generative modeling, however they remain relatively challenging to train compared to standard deep neural networks. In this paper, we propose new visualization techniques for the optimization landscapes of GANs that enable us to study the game vector field resulting from the concatenation of the gradient of both players. Using these visu… ▽ More

    Submitted 27 April, 2020; v1 submitted 11 June, 2019; originally announced June 2019.

  13. arXiv:1906.04282  [pdf, other

    cs.LG stat.ML

    Stochastic Neural Network with Kronecker Flow

    Authors: Chin-Wei Huang, Ahmed Touati, Pascal Vincent, Gintare Karolina Dziugaite, Alexandre Lacoste, Aaron Courville

    Abstract: Recent advances in variational inference enable the modelling of highly structured joint distributions, but are limited in their capacity to scale to the high-dimensional setting of stochastic neural networks. This limitation motivates a need for scalable parameterizations of the noise generation process, in a manner that adequately captures the dependencies among the various parameters. In this w… ▽ More

    Submitted 13 February, 2020; v1 submitted 10 June, 2019; originally announced June 2019.

    Comments: Proceedings of the 23rdInternational Conference on ArtificialIntelligence and Statistics (AISTATS) 2020

  14. arXiv:1906.03704  [pdf, other

    cs.LG stat.ML

    SVRG for Policy Evaluation with Fewer Gradient Evaluations

    Authors: Zilun Peng, Ahmed Touati, Pascal Vincent, Doina Precup

    Abstract: Stochastic variance-reduced gradient (SVRG) is an optimization method originally designed for tackling machine learning problems with a finite sum structure. SVRG was later shown to work for policy evaluation, a problem in reinforcement learning in which one aims to estimate the value function of a given policy. SVRG makes use of gradient estimates at two scales. At the slower scale, SVRG computes… ▽ More

    Submitted 19 June, 2020; v1 submitted 9 June, 2019; originally announced June 2019.

    Comments: Short version of the paper is published in the proceedings of the 29th International Joint Conference on Artificial Intelligence and the 17th Pacific Rim International Conference on Artificial Intelligence (IJCAI-PRICAI2020)

  15. arXiv:1811.08839  [pdf, other

    cs.CV cs.LG eess.SP physics.med-ph stat.ML

    fastMRI: An Open Dataset and Benchmarks for Accelerated MRI

    Authors: Jure Zbontar, Florian Knoll, Anuroop Sriram, Tullie Murrell, Zhengnan Huang, Matthew J. Muckley, Aaron Defazio, Ruben Stern, Patricia Johnson, Mary Bruno, Marc Parente, Krzysztof J. Geras, Joe Katsnelson, Hersh Chandarana, Zizhao Zhang, Michal Drozdzal, Adriana Romero, Michael Rabbat, Pascal Vincent, Nafissa Yakubova, James Pinkerton, Duo Wang, Erich Owens, C. Lawrence Zitnick, Michael P. Recht , et al. (2 additional authors not shown)

    Abstract: Accelerating Magnetic Resonance Imaging (MRI) by taking fewer measurements has the potential to reduce medical costs, minimize stress to patients and make MRI possible in applications where it is currently prohibitively slow or expensive. We introduce the fastMRI dataset, a large-scale collection of both raw MR measurements and clinical MR images, that can be used for training and evaluation of ma… ▽ More

    Submitted 11 December, 2019; v1 submitted 21 November, 2018; originally announced November 2018.

    Comments: 35 pages, 10 figures

  16. arXiv:1806.03884  [pdf, other

    cs.LG stat.ML

    Fast Approximate Natural Gradient Descent in a Kronecker-factored Eigenbasis

    Authors: Thomas George, César Laurent, Xavier Bouthillier, Nicolas Ballas, Pascal Vincent

    Abstract: Optimization algorithms that leverage gradient covariance information, such as variants of natural gradient descent (Amari, 1998), offer the prospect of yielding more effective descent directions. For models with many parameters, the covariance matrix they are based on becomes gigantic, making them inapplicable in their original form. This has motivated research into both simple diagonal approxima… ▽ More

    Submitted 26 July, 2021; v1 submitted 11 June, 2018; originally announced June 2018.

    Journal ref: Advances in Neural Information Processing Systems 2018

  17. arXiv:1806.02315  [pdf, other

    cs.LG stat.ML

    Randomized Value Functions via Multiplicative Normalizing Flows

    Authors: Ahmed Touati, Harsh Satija, Joshua Romoff, Joelle Pineau, Pascal Vincent

    Abstract: Randomized value functions offer a promising approach towards the challenge of efficient exploration in complex environments with high dimensional state and action spaces. Unlike traditional point estimate methods, randomized value functions maintain a posterior distribution over action-space values. This prevents the agent's behavior policy from prematurely exploiting early estimates and falling… ▽ More

    Submitted 28 June, 2019; v1 submitted 6 June, 2018; originally announced June 2018.

    Journal ref: UAI 2019: Conference on Uncertainty in Artificial Intelligence 2019

  18. arXiv:1802.10551  [pdf, other

    cs.LG math.OC stat.ML

    A Variational Inequality Perspective on Generative Adversarial Networks

    Authors: Gauthier Gidel, Hugo Berard, Gaëtan Vignoud, Pascal Vincent, Simon Lacoste-Julien

    Abstract: Generative adversarial networks (GANs) form a generative modeling approach known for producing appealing samples, but they are notably difficult to train. One common way to tackle this issue has been to propose new formulations of the GAN objective. Yet, surprisingly few studies have looked at optimization methods designed for this adversarial training. In this work, we cast GAN optimization probl… ▽ More

    Submitted 28 August, 2020; v1 submitted 28 February, 2018; originally announced February 2018.

    Comments: Appears in: Proceedings of the Seventh International Conference on Learning Representations (ICLR 2019). Minor modifications with respect to the ICLR version (First paragraph of page 2 and section 3.3): New reference [Popov 1980] and discussion with regards to the novelty of extrapolation from the past. 38 pages

    ACM Class: I.2.6; G.1.6

  19. arXiv:1708.02511  [pdf, other

    cs.LG stat.ML

    Parametric Adversarial Divergences are Good Losses for Generative Modeling

    Authors: Gabriel Huang, Hugo Berard, Ahmed Touati, Gauthier Gidel, Pascal Vincent, Simon Lacoste-Julien

    Abstract: Parametric adversarial divergences, which are a generalization of the losses used to train generative adversarial networks (GANs), have often been described as being approximations of their nonparametric counterparts, such as the Jensen-Shannon divergence, which can be derived under the so-called optimal discriminator assumption. In this position paper, we argue that despite being "non-optimal", p… ▽ More

    Submitted 21 October, 2021; v1 submitted 8 August, 2017; originally announced August 2017.

  20. arXiv:1703.06975  [pdf, other

    stat.ML cs.LG

    Learning to Generate Samples from Noise through Infusion Training

    Authors: Florian Bordes, Sina Honari, Pascal Vincent

    Abstract: In this work, we investigate a novel training procedure to learn a generative model as the transition operator of a Markov chain, such that, when applied repeatedly on an unstructured random noise sample, it will denoise it into a sample that matches the target distribution from the training set. The novel training procedure to learn this progressive denoising operation involves sampling from a sl… ▽ More

    Submitted 20 March, 2017; originally announced March 2017.

    Comments: Published as a conference paper at ICLR 2017

  21. arXiv:1609.05866  [pdf, other

    cs.LG cs.IR cs.NE stat.ML

    A Cheap Linear Attention Mechanism with Fast Lookups and Fixed-Size Representations

    Authors: Alexandre de Brébisson, Pascal Vincent

    Abstract: The softmax content-based attention mechanism has proven to be very beneficial in many applications of recurrent neural networks. Nevertheless it suffers from two major computational limitations. First, its computations for an attention lookup scale linearly in the size of the attended sequence. Second, it does not encode the sequence into a fixed-size representation but instead requires to memori… ▽ More

    Submitted 19 September, 2016; originally announced September 2016.

  22. arXiv:1605.07427  [pdf, other

    stat.ML cs.CL cs.LG cs.NE

    Hierarchical Memory Networks

    Authors: Sarath Chandar, Sungjin Ahn, Hugo Larochelle, Pascal Vincent, Gerald Tesauro, Yoshua Bengio

    Abstract: Memory networks are neural networks with an explicit memory component that can be both read and written to by the network. The memory is often addressed in a soft way using a softmax function, making end-to-end training with backpropagation possible. However, this is not computationally scalable for applications which require the network to read from extremely large memories. On the other hand, it… ▽ More

    Submitted 24 May, 2016; originally announced May 2016.

    Comments: 10 pages

  23. arXiv:1604.08859  [pdf, other

    cs.LG cs.AI stat.ML

    The Z-loss: a shift and scale invariant classification loss belonging to the Spherical Family

    Authors: Alexandre de Brébisson, Pascal Vincent

    Abstract: Despite being the standard loss function to train multi-class neural networks, the log-softmax has two potential limitations. First, it involves computations that scale linearly with the number of output classes, which can restrict the size of problems we are able to tackle with current hardware. Second, it remains unclear how close it matches the task loss such as the top-k error rate or other no… ▽ More

    Submitted 27 May, 2016; v1 submitted 29 April, 2016; originally announced April 2016.

  24. arXiv:1511.05042  [pdf, ps, other

    cs.NE cs.LG stat.ML

    An Exploration of Softmax Alternatives Belonging to the Spherical Loss Family

    Authors: Alexandre de Brébisson, Pascal Vincent

    Abstract: In a multi-class classification problem, it is standard to model the output of a neural network as a categorical distribution conditioned on the inputs. The output must therefore be positive and sum to one, which is traditionally enforced by a softmax. This probabilistic mapping allows to use the maximum likelihood principle, which leads to the well-known log-softmax loss. However the choice of th… ▽ More

    Submitted 28 February, 2016; v1 submitted 16 November, 2015; originally announced November 2015.

    Comments: Published at ICLR 2016

  25. arXiv:1507.05910  [pdf, other

    cs.LG cs.CL stat.ML

    Clustering is Efficient for Approximate Maximum Inner Product Search

    Authors: Alex Auvolat, Sarath Chandar, Pascal Vincent, Hugo Larochelle, Yoshua Bengio

    Abstract: Efficient Maximum Inner Product Search (MIPS) is an important task that has a wide applicability in recommendation systems and classification with a large number of classes. Solutions based on locality-sensitive hashing (LSH) as well as tree-based solutions have been investigated in the recent literature, to perform approximate MIPS in sublinear time. In this paper, we compare these to another ext… ▽ More

    Submitted 29 November, 2015; v1 submitted 21 July, 2015; originally announced July 2015.

    Comments: 10 pages, Under review at ICLR 2016

  26. arXiv:1506.08700  [pdf, other

    stat.ML cs.LG

    Dropout as data augmentation

    Authors: Xavier Bouthillier, Kishore Konda, Pascal Vincent, Roland Memisevic

    Abstract: Dropout is typically interpreted as bagging a large number of models sharing parameters. We show that using dropout in a network can also be interpreted as a kind of data augmentation in the input space without domain knowledge. We present an approach to projecting the dropout noise within a network back into the input space, thereby generating augmented versions of the training data, and we show… ▽ More

    Submitted 7 January, 2016; v1 submitted 29 June, 2015; originally announced June 2015.

  27. arXiv:1206.6434  [pdf

    cs.LG stat.ML

    A Generative Process for Sampling Contractive Auto-Encoders

    Authors: Salah Rifai, Yoshua Bengio, Yann Dauphin, Pascal Vincent

    Abstract: The contractive auto-encoder learns a representation of the input data that captures the local manifold structure around each data point, through the leading singular vectors of the Jacobian of the transformation from input to representation. The corresponding singular values specify how much local variation is plausible in directions associated with the corresponding singular vectors, while remai… ▽ More

    Submitted 27 June, 2012; originally announced June 2012.

    Comments: Appears in Proceedings of the 29th International Conference on Machine Learning (ICML 2012)

  28. arXiv:1206.6392  [pdf

    cs.LG cs.SD stat.ML

    Modeling Temporal Dependencies in High-Dimensional Sequences: Application to Polyphonic Music Generation and Transcription

    Authors: Nicolas Boulanger-Lewandowski, Yoshua Bengio, Pascal Vincent

    Abstract: We investigate the problem of modeling symbolic sequences of polyphonic music in a completely general piano-roll representation. We introduce a probabilistic model based on distribution estimators conditioned on a recurrent neural network that is able to discover temporal dependencies in high-dimensional sequences. Our approach outperforms many traditional models of polyphonic music on a variety o… ▽ More

    Submitted 27 June, 2012; originally announced June 2012.

    Comments: Appears in Proceedings of the 29th International Conference on Machine Learning (ICML 2012)