Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–10 of 10 results for author: Corff, S L

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.11117  [pdf, other

    cs.LG

    Variational quantization for state space models

    Authors: Etienne David, Jean Bellot, Sylvain Le Corff

    Abstract: Forecasting tasks using large datasets gathering thousands of heterogeneous time series is a crucial statistical problem in numerous sectors. The main challenge is to model a rich variety of time series, leverage any available external signals and provide sharp predictions with statistical guarantees. In this work, we propose a new forecasting model that combines discrete state space hidden Markov… ▽ More

    Submitted 17 April, 2024; originally announced April 2024.

  2. arXiv:2404.07593  [pdf, other

    stat.ML cs.LG stat.ME

    Diffusion posterior sampling for simulation-based inference in tall data settings

    Authors: Julia Linhart, Gabriel Victorino Cardoso, Alexandre Gramfort, Sylvain Le Corff, Pedro L. C. Rodrigues

    Abstract: Determining which parameters of a non-linear model best describe a set of experimental data is a fundamental problem in science and it has gained much traction lately with the rise of complex large-scale simulators. The likelihood of such models is typically intractable, which is why classical MCMC methods can not be used. Simulation-based inference (SBI) stands out in this context by only requiri… ▽ More

    Submitted 7 June, 2024; v1 submitted 11 April, 2024; originally announced April 2024.

    Comments: 49 pages, 24 figures, 3 tables, 2 algorithms, 12 appendices, in proceedings

  3. arXiv:2402.02857  [pdf, other

    stat.ML cs.LG

    Non-asymptotic Analysis of Biased Adaptive Stochastic Approximation

    Authors: Sobihan Surendran, Antoine Godichon-Baggioni, Adeline Fermanian, Sylvain Le Corff

    Abstract: Stochastic Gradient Descent (SGD) with adaptive steps is now widely used for training deep neural networks. Most theoretical results assume access to unbiased gradient estimators, which is not the case in several recent deep learning and reinforcement learning applications that use Monte Carlo methods. This paper provides a comprehensive non-asymptotic analysis of SGD with biased gradients and ada… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

  4. arXiv:2308.07983  [pdf, other

    stat.ML cs.LG stat.ME

    Monte Carlo guided Diffusion for Bayesian linear inverse problems

    Authors: Gabriel Cardoso, Yazid Janati El Idrissi, Sylvain Le Corff, Eric Moulines

    Abstract: Ill-posed linear inverse problems arise frequently in various applications, from computational photography to medical imaging. A recent line of research exploits Bayesian inference with informative priors to handle the ill-posedness of such problems. Amongst such priors, score-based generative models (SGM) have recently been successfully applied to several different inverse problems. In this study… ▽ More

    Submitted 25 October, 2023; v1 submitted 15 August, 2023; originally announced August 2023.

    Comments: preprint

  5. arXiv:2109.09371  [pdf, other

    cs.AI cs.CL cs.NE stat.ML

    Learning Natural Language Generation from Scratch

    Authors: Alice Martin Donati, Guillaume Quispe, Charles Ollion, Sylvain Le Corff, Florian Strub, Olivier Pietquin

    Abstract: This paper introduces TRUncated ReinForcement Learning for Language (TrufLL), an original ap-proach to train conditional language models from scratch by only using reinforcement learning (RL). AsRL methods unsuccessfully scale to large action spaces, we dynamically truncate the vocabulary spaceusing a generic language model. TrufLL thus enables to train a language agent by solely interacting withi… ▽ More

    Submitted 20 September, 2021; originally announced September 2021.

  6. arXiv:2106.09620  [pdf, other

    stat.ML cs.LG

    Disentangling Identifiable Features from Noisy Data with Structured Nonlinear ICA

    Authors: Hermanni Hälvä, Sylvain Le Corff, Luc Lehéricy, Jonathan So, Yongjie Zhu, Elisabeth Gassiat, Aapo Hyvarinen

    Abstract: We introduce a new general identifiable framework for principled disentanglement referred to as Structured Nonlinear Independent Component Analysis (SNICA). Our contribution is to extend the identifiability theory of deep generative models for a very broad class of structured models. While previous works have shown identifiability for specific classes of time-series models, our theorems extend thi… ▽ More

    Submitted 27 October, 2021; v1 submitted 17 June, 2021; originally announced June 2021.

    Comments: Accepted for publication at NeurIPS 2021

  7. arXiv:2105.02814  [pdf, other

    eess.SP cs.AI

    End-to-end deep meta modelling to calibrate and optimize energy consumption and comfort

    Authors: Max Cohen, Sylvain Le Corff, Maurice Charbit, Marius Preda, Gilles Nozière

    Abstract: In this paper, we propose a new end-to-end methodology to optimize the energy performance as well as comfort and air quality in large buildings without any renovation work. We introduce a metamodel based on recurrent neural networks and trained to predict the behavior of a general class of buildings using a database sampled from a simulation program. This metamodel is then deployed in different fr… ▽ More

    Submitted 5 November, 2021; v1 submitted 1 February, 2021; originally announced May 2021.

    Comments: arXiv admin note: text overlap with arXiv:2006.12390

    Journal ref: Energy and Buildings, Elsevier, 2021

  8. arXiv:2102.08023  [pdf, other

    cs.LG eess.IV stat.ML

    Joint self-supervised blind denoising and noise estimation

    Authors: Jean Ollion, Charles Ollion, Elisabeth Gassiat, Luc Lehéricy, Sylvain Le Corff

    Abstract: We propose a novel self-supervised image blind denoising approach in which two neural networks jointly predict the clean signal and infer the noise distribution. Assuming that the noisy observations are independent conditionally to the signal, the networks can be jointly trained without clean training data. Therefore, our approach is particularly relevant for biomedical image denoising where the n… ▽ More

    Submitted 16 February, 2021; originally announced February 2021.

  9. arXiv:2007.08620  [pdf, other

    cs.LG cs.AI stat.ML

    The Monte Carlo Transformer: a stochastic self-attention model for sequence prediction

    Authors: Alice Martin, Charles Ollion, Florian Strub, Sylvain Le Corff, Olivier Pietquin

    Abstract: This paper introduces the Sequential Monte Carlo Transformer, an original approach that naturally captures the observations distribution in a transformer architecture. The keys, queries, values and attention vectors of the network are considered as the unobserved stochastic states of its hidden structure. This generative model is such that at each time step the received observation is a random fun… ▽ More

    Submitted 15 December, 2020; v1 submitted 15 July, 2020; originally announced July 2020.

  10. arXiv:2006.12390  [pdf, other

    eess.SP cs.LG stat.ML

    End-to-end deep metamodeling to calibrate and optimize energy loads

    Authors: Max Cohen, Maurice Charbit, Sylvain Le Corff, Marius Preda, Gilles Nozière

    Abstract: In this paper, we propose a new end-to-end methodology to optimize the energy performance and the comfort, air quality and hygiene of large buildings. A metamodel based on a Transformer network is introduced and trained using a dataset sampled with a simulation program. Then, a few physical parameters and the building management system settings of this metamodel are calibrated using the CMA-ES opt… ▽ More

    Submitted 19 June, 2020; originally announced June 2020.