Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–50 of 86 results for author: Durmus, A

.
  1. arXiv:2407.18609  [pdf, other

    cs.LG stat.ML

    Denoising Lévy Probabilistic Models

    Authors: Dario Shariatian, Umut Simsekli, Alain Durmus

    Abstract: Investigating noise distribution beyond Gaussian in diffusion generative models is an open problem. The Gaussian case has seen success experimentally and theoretically, fitting a unified SDE framework for score-based and denoising formulations. Recent studies suggest heavy-tailed noise distributions can address mode collapse and manage datasets with class imbalance, heavy tails, or outliers. Yoon… ▽ More

    Submitted 26 July, 2024; originally announced July 2024.

  2. arXiv:2407.14332  [pdf, ps, other

    cs.GT

    Unravelling in Collaborative Learning

    Authors: Aymeric Capitaine, Etienne Boursier, Antoine Scheid, Eric Moulines, Michael I. Jordan, El-Mahdi El-Mhamdi, Alain Durmus

    Abstract: Collaborative learning offers a promising avenue for leveraging decentralized data. However, collaboration in groups of strategic learners is not a given. In this work, we consider strategic agents who wish to train a model together but have sampling distributions of different quality. The collaboration is organized by a benevolent aggregator who gathers samples so as to maximize total welfare, bu… ▽ More

    Submitted 19 July, 2024; originally announced July 2024.

  3. arXiv:2406.19824  [pdf, ps, other

    cs.GT stat.ML

    Learning to Mitigate Externalities: the Coase Theorem with Hindsight Rationality

    Authors: Antoine Scheid, Aymeric Capitaine, Etienne Boursier, Eric Moulines, Michael I Jordan, Alain Durmus

    Abstract: In economic theory, the concept of externality refers to any indirect effect resulting from an interaction between players that affects the social welfare. Most of the models within which externality has been studied assume that agents have perfect knowledge of their environment and preferences. This is a major hindrance to the practical implementation of many proposed solutions. To address this i… ▽ More

    Submitted 3 July, 2024; v1 submitted 28 June, 2024; originally announced June 2024.

  4. arXiv:2406.04012  [pdf, other

    stat.ML cs.LG

    Theoretical Guarantees for Variational Inference with Fixed-Variance Mixture of Gaussians

    Authors: Tom Huix, Anna Korba, Alain Durmus, Eric Moulines

    Abstract: Variational inference (VI) is a popular approach in Bayesian inference, that looks for the best approximation of the posterior distribution within a parametric family, minimizing a loss that is typically the (reverse) Kullback-Leibler (KL) divergence. Despite its empirical success, the theoretical properties of VI have only received attention recently, and mostly when the parametric family is the… ▽ More

    Submitted 10 June, 2024; v1 submitted 6 June, 2024; originally announced June 2024.

  5. arXiv:2405.20636  [pdf

    cond-mat.mes-hall cond-mat.mtrl-sci

    Photoluminescence enhancement at the vertical van der Waals semiconductor-metal heterostructures

    Authors: Hafiz Muhammad Shakir, Abdulsalam Aji Suleiman, Kübra Nur Kalkan, Amir Parsi, Uğur Başçı, Mehmet Atıf Durmuş, Ahmet Osman Ölçer, Hilal Korkut, Cem Sevik, İbrahim Sarpkaya, Talip Serkan Kasırga

    Abstract: Excitons in monolayer transition metal dichalcogenides (TMDCs) offer intriguing new possibilities for optoelectronics with no analogues in bulk semiconductors. Yet, intrinsic defects in TMDCs limit the radiative exciton recombination pathways. As a result, the photoluminescence (PL) quantum yield (QY) is limited. Methods like superacid treatment, electrical doping, and plasmonic engineering can in… ▽ More

    Submitted 31 May, 2024; originally announced May 2024.

  6. arXiv:2403.11407  [pdf, other

    stat.ML cs.LG

    Divide-and-Conquer Posterior Sampling for Denoising Diffusion Priors

    Authors: Yazid Janati, Alain Durmus, Eric Moulines, Jimmy Olsson

    Abstract: Interest in the use of Denoising Diffusion Models (DDM) as priors for solving inverse Bayesian problems has recently increased significantly. However, sampling from the resulting posterior distribution poses a challenge. To solve this problem, previous works have proposed approximations to bias the drift term of the diffusion. In this work, we take a different approach and utilize the specific str… ▽ More

    Submitted 17 March, 2024; originally announced March 2024.

    Comments: preprint

  7. arXiv:2403.03811  [pdf, other

    stat.ML cs.GT cs.LG

    Incentivized Learning in Principal-Agent Bandit Games

    Authors: Antoine Scheid, Daniil Tiapkin, Etienne Boursier, Aymeric Capitaine, El Mahdi El Mhamdi, Eric Moulines, Michael I. Jordan, Alain Durmus

    Abstract: This work considers a repeated principal-agent bandit game, where the principal can only interact with her environment through the agent. The principal and the agent have misaligned objectives and the choice of action is only left to the agent. However, the principal can influence the agent's decisions by offering incentives which add up to his rewards. The principal aims to iteratively learn an i… ▽ More

    Submitted 6 March, 2024; originally announced March 2024.

  8. arXiv:2403.02506  [pdf, other

    cs.CV cs.LG

    Differentially Private Representation Learning via Image Captioning

    Authors: Tom Sander, Yaodong Yu, Maziar Sanjabi, Alain Durmus, Yi Ma, Kamalika Chaudhuri, Chuan Guo

    Abstract: Differentially private (DP) machine learning is considered the gold-standard solution for training a model from sensitive data while still preserving privacy. However, a major barrier to achieving this ideal is its sub-optimal privacy-accuracy trade-off, which is particularly visible in DP representation learning. Specifically, it has been shown that under modest privacy budgets, most models learn… ▽ More

    Submitted 4 March, 2024; originally announced March 2024.

  9. arXiv:2402.17870  [pdf, other

    stat.CO cs.LG math.OC stat.ML

    Stochastic Approximation with Biased MCMC for Expectation Maximization

    Authors: Samuel Gruffaz, Kyurae Kim, Alain Oliviero Durmus, Jacob R. Gardner

    Abstract: The expectation maximization (EM) algorithm is a widespread method for empirical Bayesian inference, but its expectation step (E-step) is often intractable. Employing a stochastic approximation scheme with Markov chain Monte Carlo (MCMC) can circumvent this issue, resulting in an algorithm known as MCMC-SAEM. While theoretical guarantees for MCMC-SAEM have previously been established, these result… ▽ More

    Submitted 27 February, 2024; originally announced February 2024.

    Comments: Accepted to AISTATS'24

  10. arXiv:2402.14904  [pdf, other

    cs.CR cs.AI cs.CL cs.LG

    Watermarking Makes Language Models Radioactive

    Authors: Tom Sander, Pierre Fernandez, Alain Durmus, Matthijs Douze, Teddy Furon

    Abstract: This paper investigates the radioactivity of LLM-generated texts, i.e. whether it is possible to detect that such input was used as training data. Conventional methods like membership inference can carry out this detection with some level of accuracy. We show that watermarked training data leaves traces easier to detect and much more reliable than membership inference. We link the contamination le… ▽ More

    Submitted 22 February, 2024; originally announced February 2024.

  11. arXiv:2402.10758  [pdf, other

    stat.ML cs.LG stat.CO

    Stochastic Localization via Iterative Posterior Sampling

    Authors: Louis Grenioux, Maxence Noble, Marylou Gabrié, Alain Oliviero Durmus

    Abstract: Building upon score-based learning, new interest in stochastic localization techniques has recently emerged. In these models, one seeks to noise a sample from the data distribution through a stochastic process, called observation process, and progressively learns a denoiser associated to this dynamics. Apart from specific applications, the use of stochastic localization for the problem of sampling… ▽ More

    Submitted 28 May, 2024; v1 submitted 16 February, 2024; originally announced February 2024.

    Comments: Accepted at ICML 2024

  12. arXiv:2402.08344  [pdf, other

    stat.ML cs.LG

    Implicit Bias in Noisy-SGD: With Applications to Differentially Private Training

    Authors: Tom Sander, Maxime Sylvestre, Alain Durmus

    Abstract: Training Deep Neural Networks (DNNs) with small batches using Stochastic Gradient Descent (SGD) yields superior test performance compared to larger batches. The specific noise structure inherent to SGD is known to be responsible for this implicit bias. DP-SGD, used to ensure differential privacy (DP) in DNNs' training, adds Gaussian noise to the clipped gradients. Surprisingly, large-batch trainin… ▽ More

    Submitted 13 February, 2024; originally announced February 2024.

  13. arXiv:2402.06447  [pdf, other

    math.OC math.PR

    On the irreducibility and convergence of a class of nonsmooth nonlinear state-space models on manifolds

    Authors: Armand Gissler, Alain Durmus, Anne Auger

    Abstract: In this paper, we analyze a large class of general nonlinear state-space models on a state-space X, defined by the recursion $φ_{k+1} = F(φ_k,α(φ_k,U_{k+1}))$, $k \in\bN$, where $F,α$ are some functions and $\{U_{k+1}\}_{k\in\bN}$ is a sequence of i.i.d. random variables. More precisely, we extend conditions under which this class of Markov chains is irreducible, aperiodic and satisfies important… ▽ More

    Submitted 9 February, 2024; originally announced February 2024.

  14. arXiv:2312.00417  [pdf, other

    stat.CO math.PR stat.AP

    Geodesic slice sampling on Riemannian manifolds

    Authors: Alain Durmus, Samuel Gruffaz, Mareike Hasenpflug, Daniel Rudolf

    Abstract: We propose a theoretically justified and practically applicable slice sampling based Markov chain Monte Carlo (MCMC) method for approximate sampling from probability measures on Riemannian manifolds. The latter naturally arise as posterior distributions in Bayesian inference of matrix-valued parameters, for example belonging to either the Stiefel or the Grassmann manifold. Our method, called geode… ▽ More

    Submitted 19 April, 2024; v1 submitted 1 December, 2023; originally announced December 2023.

    Comments: Journal paper of 51 pages with appendix

    MSC Class: 60-08 ACM Class: G.3

  15. arXiv:2310.18455  [pdf, other

    cs.LG stat.ML

    Approximate Heavy Tails in Offline (Multi-Pass) Stochastic Gradient Descent

    Authors: Krunoslav Lehman Pavasovic, Alain Durmus, Umut Simsekli

    Abstract: A recent line of empirical studies has demonstrated that SGD might exhibit a heavy-tailed behavior in practical settings, and the heaviness of the tails might correlate with the overall performance. In this paper, we investigate the emergence of such heavy tails. Previous works on this problem only considered, up to our knowledge, online (also called single-pass) SGD, in which the emergence of hea… ▽ More

    Submitted 27 October, 2023; originally announced October 2023.

    Comments: In Neural Information Processing Systems (NeurIPS), Spotlight Presentation, 2023

  16. arXiv:2308.12240  [pdf, ps, other

    math.ST stat.ML

    Score diffusion models without early stopping: finite Fisher information is all you need

    Authors: Giovanni Conforti, Alain Durmus, Marta Gentiloni Silveri

    Abstract: Diffusion models are a new class of generative models that revolve around the estimation of the score function associated with a stochastic differential equation. Subsequent to its acquisition, the approximated score function is then harnessed to simulate the corresponding time-reversal process, ultimately enabling the generation of approximate data samples. Despite their evident practical signifi… ▽ More

    Submitted 23 August, 2023; originally announced August 2023.

  17. arXiv:2307.10167  [pdf, other

    stat.ML cs.LG

    VITS : Variational Inference Thompson Sampling for contextual bandits

    Authors: Pierre Clavier, Tom Huix, Alain Durmus

    Abstract: In this paper, we introduce and analyze a variant of the Thompson sampling (TS) algorithm for contextual bandits. At each round, traditional TS requires samples from the current posterior distribution, which is usually intractable. To circumvent this issue, approximate inference techniques can be used and provide samples with distribution close to the posteriors. However, current approximate techn… ▽ More

    Submitted 20 July, 2024; v1 submitted 19 July, 2023; originally announced July 2023.

  18. arXiv:2307.03460  [pdf, other

    stat.CO math.PR math.ST stat.ML

    On the convergence of dynamic implementations of Hamiltonian Monte Carlo and No U-Turn Samplers

    Authors: Alain Durmus, Samuel Gruffaz, Miika Kailas, Eero Saksman, Matti Vihola

    Abstract: There is substantial empirical evidence about the success of dynamic implementations of Hamiltonian Monte Carlo (HMC), such as the No U-Turn Sampler (NUTS), in many challenging inference problems but theoretical results about their behavior are scarce. The aim of this paper is to fill this gap. More precisely, we consider a general class of MCMC algorithms we call dynamic HMC. We show that this ge… ▽ More

    Submitted 7 July, 2023; originally announced July 2023.

    Comments: 24 pages without appendix and references, 2 figures, a future journal paper

    MSC Class: 62

  19. arXiv:2306.09513  [pdf, ps, other

    math.PR math.NA

    Second order quantitative bounds for unadjusted generalized Hamiltonian Monte Carlo

    Authors: Evan Camrud, Alain Durmus, Pierre Monmarché, Gabriel Stoltz

    Abstract: This paper provides a convergence analysis for generalized Hamiltonian Monte Carlo samplers, a family of Markov Chain Monte Carlo methods based on leapfrog integration of Hamiltonian dynamics and kinetic Langevin diffusion, that encompasses the unadjusted Hamiltonian Monte Carlo method. Assuming that the target distribution $π$ satisfies a log-Sobolev inequality and mild conditions on the correspo… ▽ More

    Submitted 13 May, 2024; v1 submitted 15 June, 2023; originally announced June 2023.

  20. arXiv:2305.16557  [pdf, other

    stat.ML cs.LG math.PR

    Tree-Based Diffusion Schrödinger Bridge with Applications to Wasserstein Barycenters

    Authors: Maxence Noble, Valentin De Bortoli, Arnaud Doucet, Alain Durmus

    Abstract: Multi-marginal Optimal Transport (mOT), a generalization of OT, aims at minimizing the integral of a cost function with respect to a distribution with some prescribed marginals. In this paper, we consider an entropic version of mOT with a tree-structured quadratic cost, i.e., a function that can be written as a sum of pairwise cost functions between the nodes of a tree. To address this problem, we… ▽ More

    Submitted 28 October, 2023; v1 submitted 25 May, 2023; originally announced May 2023.

  21. arXiv:2304.06549  [pdf, ps, other

    math.PR math.OC stat.ML

    Non-asymptotic convergence bounds for Sinkhorn iterates and their gradients: a coupling approach

    Authors: Giacomo Greco, Maxence Noble, Giovanni Conforti, Alain Durmus

    Abstract: Computational optimal transport (OT) has recently emerged as a powerful framework with applications in various fields. In this paper we focus on a relaxation of the original OT problem, the entropic OT problem, which allows to implement efficient and practical algorithmic solutions, even in high dimensional settings. This formulation, also known as the Schrödinger Bridge problem, notably connects… ▽ More

    Submitted 26 June, 2023; v1 submitted 13 April, 2023; originally announced April 2023.

    Comments: version accepted to COLT 2023

    MSC Class: 49Q22; 93E20 (Primary) 49N05; 90C25; 47D07

  22. arXiv:2304.04451  [pdf, ps, other

    math.PR math.OC

    Quantitative contraction rates for Sinkhorn algorithm: beyond bounded costs and compact marginals

    Authors: Giovanni Conforti, Alain Durmus, Giacomo Greco

    Abstract: We show non-asymptotic exponential convergence of Sinkhorn iterates to the Schrödinger potentials, solutions of the quadratic Entropic Optimal Transport problem on $\mathbb{R}^d$. Our results hold under mild assumptions on the marginal inputs: in particular, we only assume that they admit an asymptotically positive log-concavity profile, covering as special cases log-concave distributions and boun… ▽ More

    Submitted 20 June, 2024; v1 submitted 10 April, 2023; originally announced April 2023.

    Comments: 34 pages, simplified presentation of main results, added explicit expression for the exponential convergence rates and added stronger results in the log-concave setting

    MSC Class: 49Q22; 90C25 (Primary) 49N05; 93E20; 47D07 (Secondary)

  23. arXiv:2303.05838  [pdf, ps, other

    math.PR math.ST stat.ML

    Rosenthal-type inequalities for linear statistics of Markov chains

    Authors: Alain Durmus, Eric Moulines, Alexey Naumov, Sergey Samsonov, Marina Sheshukova

    Abstract: In this paper, we establish novel deviation bounds for additive functionals of geometrically ergodic Markov chains similar to Rosenthal and Bernstein inequalities for sums of independent random variables. We pay special attention to the dependence of our bounds on the mixing time of the corresponding chain. More precisely, we establish explicit bounds that are linked to the constants from the mart… ▽ More

    Submitted 28 June, 2023; v1 submitted 10 March, 2023; originally announced March 2023.

    MSC Class: 60E15; 60J20; 65C40

  24. arXiv:2302.04763  [pdf, other

    stat.ML cs.LG

    On Sampling with Approximate Transport Maps

    Authors: Louis Grenioux, Alain Durmus, Éric Moulines, Marylou Gabrié

    Abstract: Transport maps can ease the sampling of distributions with non-trivial geometries by transforming them into distributions that are easier to handle. The potential of this approach has risen with the development of Normalizing Flows (NF) which are maps parameterized with deep neural networks trained to push a reference distribution towards a target. NF-enhanced samplers recently proposed blend (Mar… ▽ More

    Submitted 18 February, 2024; v1 submitted 9 February, 2023; originally announced February 2023.

  25. arXiv:2301.02446  [pdf, other

    stat.CO math.PR math.ST

    Optimal Scaling Results for Moreau-Yosida Metropolis-adjusted Langevin Algorithms

    Authors: Francesca R. Crucinio, Alain Durmus, Pablo Jiménez, Gareth O. Roberts

    Abstract: We consider a recently proposed class of MCMC methods which uses proximity maps instead of gradients to build proposal mechanisms which can be employed for both differentiable and non-differentiable targets. These methods have been shown to be stable for a wide class of targets, making them a valuable alternative to Metropolis-adjusted Langevin algorithms (MALA); and have found wide application in… ▽ More

    Submitted 19 June, 2024; v1 submitted 6 January, 2023; originally announced January 2023.

    MSC Class: 65C05; 60F05

  26. arXiv:2211.00100  [pdf, other

    stat.ML cs.LG

    Federated Averaging Langevin Dynamics: Toward a unified theory and new algorithms

    Authors: Vincent Plassier, Alain Durmus, Eric Moulines

    Abstract: This paper focuses on Bayesian inference in a federated learning context (FL). While several distributed MCMC algorithms have been proposed, few consider the specific limitations of FL such as communication bottlenecks and statistical heterogeneity. Recently, Federated Averaging Langevin Dynamics (FALD) was introduced, which extends the Federated Averaging algorithm to Bayesian inference. We obtai… ▽ More

    Submitted 31 October, 2022; originally announced November 2022.

    Comments: 58 pages

  27. arXiv:2210.11925  [pdf, other

    stat.ML cs.LG math.PR

    Unbiased constrained sampling with Self-Concordant Barrier Hamiltonian Monte Carlo

    Authors: Maxence Noble, Valentin De Bortoli, Alain Durmus

    Abstract: In this paper, we propose Barrier Hamiltonian Monte Carlo (BHMC), a version of the HMC algorithm which aims at sampling from a Gibbs distribution $π$ on a manifold $\mathrm{M}$, endowed with a Hessian metric $\mathfrak{g}$ derived from a self-concordant barrier. Our method relies on Hamiltonian dynamics which comprises $\mathfrak{g}$. Therefore, it incorporates the constraints defining… ▽ More

    Submitted 28 October, 2023; v1 submitted 21 October, 2022; originally announced October 2022.

  28. arXiv:2207.04475  [pdf, ps, other

    stat.ML cs.LG math.PR math.ST

    Finite-time High-probability Bounds for Polyak-Ruppert Averaged Iterates of Linear Stochastic Approximation

    Authors: Alain Durmus, Eric Moulines, Alexey Naumov, Sergey Samsonov

    Abstract: This paper provides a finite-time analysis of linear stochastic approximation (LSA) algorithms with fixed step size, a core method in statistics and machine learning. LSA is used to compute approximate solutions of a $d$-dimensional linear system $\bar{\mathbf{A}} θ= \bar{\mathbf{b}}$ for which $(\bar{\mathbf{A}}, \bar{\mathbf{b}})$ can only be estimated by (asymptotically) unbiased observations… ▽ More

    Submitted 29 March, 2023; v1 submitted 10 July, 2022; originally announced July 2022.

    MSC Class: 62L20; 60J20

  29. arXiv:2207.03859  [pdf, other

    stat.ML cs.LG

    Variational Inference of overparameterized Bayesian Neural Networks: a theoretical and empirical study

    Authors: Tom Huix, Szymon Majewski, Alain Durmus, Eric Moulines, Anna Korba

    Abstract: This paper studies the Variational Inference (VI) used for training Bayesian Neural Networks (BNN) in the overparameterized regime, i.e., when the number of neurons tends to infinity. More specifically, we consider overparameterized two-layer BNN and point out a critical issue in the mean-field VI training. This problem arises from the decomposition of the lower bound on the evidence (ELBO) into t… ▽ More

    Submitted 8 July, 2022; originally announced July 2022.

  30. arXiv:2206.03611  [pdf, other

    cs.LG stat.ME stat.ML

    FedPop: A Bayesian Approach for Personalised Federated Learning

    Authors: Nikita Kotelevskii, Maxime Vono, Eric Moulines, Alain Durmus

    Abstract: Personalised federated learning (FL) aims at collaboratively learning a machine learning model taylored for each client. Albeit promising advances have been made in this direction, most of existing approaches works do not allow for uncertainty quantification which is crucial in many applications. In addition, personalisation in the cross-device setting still involves important issues, especially f… ▽ More

    Submitted 26 January, 2023; v1 submitted 7 June, 2022; originally announced June 2022.

  31. arXiv:2201.07652  [pdf, ps, other

    math.PR

    Sticky nonlinear SDEs and convergence of McKean-Vlasov equations without confinement

    Authors: Alain Durmus, Andreas Eberle, Arnaud Guillin, Katharina Schuh

    Abstract: We develop a new approach to study the long time behaviour of solutions to nonlinear stochastic differential equations in the sense of McKean, as well as propagation of chaos for the corresponding mean-field particle system approximations. Our approach is based on a sticky coupling between two solutions to the equation. We show that the distance process between the two copies is dominated by a sol… ▽ More

    Submitted 13 November, 2022; v1 submitted 19 January, 2022; originally announced January 2022.

    Comments: 46 pages

    MSC Class: 60H10 (Primary); 60J60; 82C31 (Secondary)

  32. arXiv:2201.06133  [pdf, other

    stat.ML cs.CV cs.LG eess.IV math.OC

    On Maximum-a-Posteriori estimation with Plug & Play priors and stochastic gradient descent

    Authors: Rémi Laumont, Valentin de Bortoli, Andrés Almansa, Julie Delon, Alain Durmus, Marcelo Pereyra

    Abstract: Bayesian methods to solve imaging inverse problems usually combine an explicit data likelihood function with a prior distribution that explicitly models expected properties of the solution. Many kinds of priors have been explored in the literature, from simple ones expressing local properties to more involved ones exploiting image redundancy at a non-local scale. In a departure from explicit model… ▽ More

    Submitted 16 January, 2022; originally announced January 2022.

    MSC Class: 65K10 (Primary) 65K05; 62F15; 62C10; 68Q25; 68U10; 90C26 (Secondary) 65K10; 65K05; 62F15; 62C10; 68Q25; 68U10; 90C26

  33. arXiv:2201.05002  [pdf, other

    stat.CO math.NA math.PR

    Boost your favorite Markov Chain Monte Carlo sampler using Kac's theorem: the Kick-Kac teleportation algorithm

    Authors: Randal Douc, Alain Durmus, Aurélien Enfroy, Jimmy Olsson

    Abstract: The present paper focuses on the problem of sampling from a given target distribution $π$ defined on some general state space. To this end, we introduce a novel class of non-reversible Markov chains, each chain being defined on an extended state space and having an invariant probability measure admitting $π$ as a marginal distribution. The proposed methodology is inspired by a new formulation of K… ▽ More

    Submitted 13 May, 2023; v1 submitted 13 January, 2022; originally announced January 2022.

    MSC Class: 62-08; 60J20; 65C05 (Primary); 60F15 (Secondary)

  34. arXiv:2201.01951  [pdf, ps, other

    stat.CO math.NA math.PR

    On the geometric convergence for MALA under verifiable conditions

    Authors: Alain Durmus, Éric Moulines

    Abstract: While the Metropolis Adjusted Langevin Algorithm (MALA) is a popular and widely used Markov chain Monte Carlo method, very few papers derive conditions that ensure its convergence. In particular, to the authors' knowledge, assumptions that are both easy to verify and guarantee geometric convergence, are still missing. In this work, we establish $V$-uniformly geometric convergence for MALA under mi… ▽ More

    Submitted 6 January, 2022; originally announced January 2022.

  35. arXiv:2111.02702  [pdf, other

    stat.ML cs.LG

    Local-Global MCMC kernels: the best of both worlds

    Authors: Sergey Samsonov, Evgeny Lagutin, Marylou Gabrié, Alain Durmus, Alexey Naumov, Eric Moulines

    Abstract: Recent works leveraging learning to enhance sampling have shown promising results, in particular by designing effective non-local moves and global proposals. However, learning accuracy is inevitably limited in regions where little data is available such as in the tails of distributions as well as in high-dimensional problems. In the present paper we study an Explore-Exploit Markov chain Monte Carl… ▽ More

    Submitted 4 October, 2022; v1 submitted 4 November, 2021; originally announced November 2021.

    Comments: arXiv admin note: text overlap with arXiv:1111.5421 by other authors

  36. arXiv:2109.00331  [pdf, ps, other

    math.PR

    Probability and moment inequalities for additive functionals of geometrically ergodic Markov chains

    Authors: Alain Durmus, Eric Moulines, Alexey Naumov, Sergey Samsonov

    Abstract: In this paper, we establish moment and Bernstein-type inequalities for additive functionals of geometrically ergodic Markov chains. These inequalities extend the corresponding inequalities for independent random variables. Our conditions cover Markov chains converging geometrically to the stationary distribution either in $V$-norms or in weighted Wasserstein distances. Our inequalities apply to un… ▽ More

    Submitted 15 June, 2023; v1 submitted 1 September, 2021; originally announced September 2021.

    MSC Class: 60E15; 60J20; 65C40

  37. arXiv:2108.00682  [pdf, other

    math.PR math.NA stat.CO stat.ML

    Asymptotic bias of inexact Markov Chain Monte Carlo methods in high dimension

    Authors: Alain Oliviero Durmus, Andreas Eberle

    Abstract: Inexact Markov Chain Monte Carlo methods rely on Markov chains that do not exactly preserve the target distribution. Examples include the unadjusted Langevin algorithm (ULA) and unadjusted Hamiltonian Monte Carlo (uHMC). This paper establishes bounds on Wasserstein distances between the invariant probability measures of inexact MCMC methods and their target distributions with a focus on understand… ▽ More

    Submitted 12 April, 2023; v1 submitted 2 August, 2021; originally announced August 2021.

  38. arXiv:2107.14542  [pdf, ps, other

    math.PR math.NA stat.CO

    Uniform minorization condition and convergence bounds for discretizations of kinetic Langevin dynamics

    Authors: Alain Durmus, Aurélien Enfroy, Éric Moulines, Gabriel Stoltz

    Abstract: We study the convergence in total variation and $V$-norm of discretization schemes of the underdamped Langevin dynamics. Such algorithms are very popular and commonly used in molecular dynamics and computational statistics to approximatively sample from a target distribution of interest. We show first that, for a very large class of schemes, a minorization condition uniform in the stepsize holds.… ▽ More

    Submitted 21 April, 2023; v1 submitted 30 July, 2021; originally announced July 2021.

  39. arXiv:2106.15921  [pdf, other

    stat.ML cs.LG

    Monte Carlo Variational Auto-Encoders

    Authors: Achille Thin, Nikita Kotelevskii, Arnaud Doucet, Alain Durmus, Eric Moulines, Maxim Panov

    Abstract: Variational auto-encoders (VAE) are popular deep latent variable models which are trained by maximizing an Evidence Lower Bound (ELBO). To obtain tighter ELBO and hence better variational approximations, it has been proposed to use importance sampling to get a lower variance estimate of the evidence. However, importance sampling is known to perform poorly in high dimensions. While it has been sugg… ▽ More

    Submitted 30 June, 2021; originally announced June 2021.

  40. arXiv:2106.15427  [pdf, other

    stat.ML cs.LG

    Fast Approximation of the Sliced-Wasserstein Distance Using Concentration of Random Projections

    Authors: Kimia Nadjahi, Alain Durmus, Pierre E. Jacob, Roland Badeau, Umut Şimşekli

    Abstract: The Sliced-Wasserstein distance (SW) is being increasingly used in machine learning applications as an alternative to the Wasserstein distance and offers significant computational and statistical benefits. Since it is defined as an expectation over random projections, SW is commonly approximated by Monte Carlo. We adopt a new perspective to approximate SW by making use of the concentration of meas… ▽ More

    Submitted 4 January, 2022; v1 submitted 29 June, 2021; originally announced June 2021.

    Comments: Published at NeurIPS 2021

  41. arXiv:2106.06300  [pdf, other

    stat.ME cs.AI cs.LG stat.CO

    DG-LMC: A Turn-key and Scalable Synchronous Distributed MCMC Algorithm via Langevin Monte Carlo within Gibbs

    Authors: Vincent Plassier, Maxime Vono, Alain Durmus, Eric Moulines

    Abstract: Performing reliable Bayesian inference on a big data scale is becoming a keystone in the modern era of machine learning. A workhorse class of methods to achieve this task are Markov chain Monte Carlo (MCMC) algorithms and their design to handle distributed datasets has been the subject of many works. However, existing methods are not completely either reliable or computationally efficient. In this… ▽ More

    Submitted 18 June, 2021; v1 submitted 11 June, 2021; originally announced June 2021.

    Comments: 77 pages. Accepted for publication at ICML 2021, to appear

  42. arXiv:2106.01257  [pdf, ps, other

    stat.ML cs.LG math.PR math.ST

    Tight High Probability Bounds for Linear Stochastic Approximation with Fixed Stepsize

    Authors: Alain Durmus, Eric Moulines, Alexey Naumov, Sergey Samsonov, Kevin Scaman, Hoi-To Wai

    Abstract: This paper provides a non-asymptotic analysis of linear stochastic approximation (LSA) algorithms with fixed stepsize. This family of methods arises in many machine learning tasks and is used to obtain approximate solutions of a linear system $\bar{A}θ= \bar{b}$ for which $\bar{A}$ and $\bar{b}$ can only be accessed through random estimates $\{({\bf A}_n, {\bf b}_n): n \in \mathbb{N}^*\}$. Our ana… ▽ More

    Submitted 2 June, 2021; originally announced June 2021.

    Comments: 21 pages

  43. arXiv:2106.00797  [pdf, other

    cs.LG cs.AI stat.CO stat.ME stat.ML

    QLSD: Quantised Langevin stochastic dynamics for Bayesian federated learning

    Authors: Maxime Vono, Vincent Plassier, Alain Durmus, Aymeric Dieuleveut, Eric Moulines

    Abstract: The objective of Federated Learning (FL) is to perform statistical inference for data which are decentralised and stored locally on networked clients. FL raises many constraints which include privacy and data ownership, communication overhead, statistical heterogeneity, and partial client participation. In this paper, we address these problems in the framework of the Bayesian paradigm. To this end… ▽ More

    Submitted 31 May, 2022; v1 submitted 1 June, 2021; originally announced June 2021.

  44. arXiv:2104.06771  [pdf, other

    math.PR stat.CO

    Discrete sticky couplings of functional autoregressive processes

    Authors: Alain Durmus, Andreas Eberle, Aurélien Enfroy, Arnaud Guillin, Pierre Monmarché

    Abstract: In this paper, we provide bounds in Wasserstein and total variation distances between the distributions of the successive iterates of two functional autoregressive processes with isotropic Gaussian noise of the form $Y_{k+1} = \mathrm{T}_γ(Y_k) + \sqrt{γσ^2} Z_{k+1}$ and $\tilde{Y}_{k+1} = \tilde{\mathrm{T}}_γ(\tilde{Y}_k) + \sqrt{γσ^2} \tilde{Z}_{k+1}$. More precisely, we give non-asymptotic boun… ▽ More

    Submitted 28 November, 2023; v1 submitted 14 April, 2021; originally announced April 2021.

  45. arXiv:2103.10943  [pdf, other

    stat.CO stat.ME stat.ML

    NEO: Non Equilibrium Sampling on the Orbit of a Deterministic Transform

    Authors: Achille Thin, Yazid Janati, Sylvain Le Corff, Charles Ollion, Arnaud Doucet, Alain Durmus, Eric Moulines, Christian Robert

    Abstract: Sampling from a complex distribution $π$ and approximating its intractable normalizing constant Z are challenging problems. In this paper, a novel family of importance samplers (IS) and Markov chain Monte Carlo (MCMC) samplers is derived. Given an invertible map T, these schemes combine (with weights) elements from the forward and backward Orbits through points sampled from a proposal distributi… ▽ More

    Submitted 23 August, 2021; v1 submitted 17 March, 2021; originally announced March 2021.

  46. arXiv:2103.04715  [pdf, other

    stat.ME cs.CV eess.IV math.ST stat.ML

    Bayesian imaging using Plug & Play priors: when Langevin meets Tweedie

    Authors: Rémi Laumont, Valentin de Bortoli, Andrés Almansa, Julie Delon, Alain Durmus, Marcelo Pereyra

    Abstract: Since the seminal work of Venkatakrishnan et al. in 2013, Plug & Play (PnP) methods have become ubiquitous in Bayesian imaging. These methods derive Minimum Mean Square Error (MMSE) or Maximum A Posteriori (MAP) estimators for inverse problems in imaging by combining an explicit likelihood function with a prior that is implicitly defined by an image denoising algorithm. The PnP algorithms proposed… ▽ More

    Submitted 12 January, 2022; v1 submitted 8 March, 2021; originally announced March 2021.

    MSC Class: 65K10; 65K05; 65D18; 62F15; 62C10; 68Q25; 68U10; 90C26

  47. arXiv:2102.07586  [pdf, other

    stat.ML cs.LG math.PR

    On Riemannian Stochastic Approximation Schemes with Fixed Step-Size

    Authors: Alain Durmus, Pablo Jiménez, Éric Moulines, Salem Said

    Abstract: This paper studies fixed step-size stochastic approximation (SA) schemes, including stochastic gradient schemes, in a Riemannian framework. It is motivated by several applications, where geodesics can be computed explicitly, and their use accelerates crude Euclidean methods. A fixed step-size scheme defines a family of time-homogeneous Markov chains, parametrized by the step-size. Here, using this… ▽ More

    Submitted 19 February, 2021; v1 submitted 15 February, 2021; originally announced February 2021.

    Comments: 37 pages, 4 figures, to appear in AISTAT21

    MSC Class: 60F05

  48. arXiv:2102.00185  [pdf, ps, other

    stat.ML cs.LG math.PR math.ST

    On the Stability of Random Matrix Product with Markovian Noise: Application to Linear Stochastic Approximation and TD Learning

    Authors: Alain Durmus, Eric Moulines, Alexey Naumov, Sergey Samsonov, Hoi-To Wai

    Abstract: This paper studies the exponential stability of random matrix products driven by a general (possibly unbounded) state space Markov chain. It is a cornerstone in the analysis of stochastic algorithms in machine learning (e.g. for parameter tracking in online learning or reinforcement learning). The existing results impose strong conditions such as uniform boundedness of the matrix-valued functions… ▽ More

    Submitted 30 January, 2021; originally announced February 2021.

  49. arXiv:2012.15550  [pdf, ps, other

    stat.CO stat.ML

    Nonreversible MCMC from conditional invertible transforms: a complete recipe with convergence guarantees

    Authors: Achille Thin, Nikita Kotelevskii, Christophe Andrieu, Alain Durmus, Eric Moulines, Maxim Panov

    Abstract: Markov Chain Monte Carlo (MCMC) is a class of algorithms to sample complex and high-dimensional probability distributions. The Metropolis-Hastings (MH) algorithm, the workhorse of MCMC, provides a simple recipe to construct reversible Markov kernels. Reversibility is a tractable property that implies a less tractable but essential property here, invariance. Reversibility is however not necessarily… ▽ More

    Submitted 29 March, 2021; v1 submitted 31 December, 2020; originally announced December 2020.

  50. arXiv:2008.05793  [pdf, ps, other

    math.ST math.PR stat.CO

    Maximum likelihood estimation of regularisation parameters in high-dimensional inverse problems: an empirical Bayesian approach. Part II: Theoretical Analysis

    Authors: Valentin De Bortoli, Alain Durmus, Ana F. Vidal, Marcelo Pereyra

    Abstract: This paper presents a detailed theoretical analysis of the three stochastic approximation proximal gradient algorithms proposed in our companion paper [49] to set regularization parameters by marginal maximum likelihood estimation. We prove the convergence of a more general stochastic approximation scheme that includes the three algorithms of [49] as special cases. This includes asymptotic and non… ▽ More

    Submitted 13 August, 2020; originally announced August 2020.

    Comments: SIIMS 2020 - 30 pages