Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
Skip to main content

Showing 1–27 of 27 results for author: Kingma, D P

Searching in archive cs. Search in all archives.
  1. arXiv:2405.16852  [pdf, other

    cs.LG cs.AI stat.ML

    EM Distillation for One-step Diffusion Models

    Authors: Sirui Xie, Zhisheng Xiao, Diederik P Kingma, Tingbo Hou, Ying Nian Wu, Kevin Patrick Murphy, Tim Salimans, Ben Poole, Ruiqi Gao

    Abstract: While diffusion models can learn complex distributions, sampling requires a computationally expensive iterative process. Existing distillation methods enable efficient sampling, but have notable limitations, such as performance degradation with very few sampling steps, reliance on training data access, or mode-seeking optimization that may fail to capture the full distribution. We propose EM Disti… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

  2. arXiv:2303.00848  [pdf, other

    cs.LG cs.AI stat.ML

    Understanding Diffusion Objectives as the ELBO with Simple Data Augmentation

    Authors: Diederik P. Kingma, Ruiqi Gao

    Abstract: To achieve the highest perceptual quality, state-of-the-art diffusion models are optimized with objectives that typically look very different from the maximum likelihood and the Evidence Lower Bound (ELBO) objectives. In this work, we reveal that diffusion model objectives are actually closely related to the ELBO. Specifically, we show that all commonly used diffusion model objectives equate to… ▽ More

    Submitted 25 September, 2023; v1 submitted 1 March, 2023; originally announced March 2023.

  3. arXiv:2210.03142  [pdf, other

    cs.CV cs.AI cs.LG

    On Distillation of Guided Diffusion Models

    Authors: Chenlin Meng, Robin Rombach, Ruiqi Gao, Diederik P. Kingma, Stefano Ermon, Jonathan Ho, Tim Salimans

    Abstract: Classifier-free guided diffusion models have recently been shown to be highly effective at high-resolution image generation, and they have been widely used in large-scale diffusion frameworks including DALLE-2, Stable Diffusion and Imagen. However, a downside of classifier-free guided diffusion models is that they are computationally expensive at inference time since they require evaluating two di… ▽ More

    Submitted 12 April, 2023; v1 submitted 6 October, 2022; originally announced October 2022.

    Comments: CVPR 2023, Award candidate

  4. arXiv:2210.02303  [pdf, other

    cs.CV cs.LG

    Imagen Video: High Definition Video Generation with Diffusion Models

    Authors: Jonathan Ho, William Chan, Chitwan Saharia, Jay Whang, Ruiqi Gao, Alexey Gritsenko, Diederik P. Kingma, Ben Poole, Mohammad Norouzi, David J. Fleet, Tim Salimans

    Abstract: We present Imagen Video, a text-conditional video generation system based on a cascade of video diffusion models. Given a text prompt, Imagen Video generates high definition videos using a base video generation model and a sequence of interleaved spatial and temporal video super-resolution models. We describe how we scale up the system as a high definition text-to-video model including design deci… ▽ More

    Submitted 5 October, 2022; originally announced October 2022.

    Comments: See accompanying website: https://imagen.research.google/video/

  5. arXiv:2107.00630  [pdf, other

    cs.LG stat.ML

    Variational Diffusion Models

    Authors: Diederik P. Kingma, Tim Salimans, Ben Poole, Jonathan Ho

    Abstract: Diffusion-based generative models have demonstrated a capacity for perceptually impressive synthesis, but can they also be great likelihood-based models? We answer this in the affirmative, and introduce a family of diffusion-based generative models that obtain state-of-the-art likelihoods on standard image density estimation benchmarks. Unlike other diffusion-based models, our method allows for ef… ▽ More

    Submitted 13 April, 2023; v1 submitted 1 July, 2021; originally announced July 2021.

    Comments: Published at NeurIPS'21

  6. arXiv:2101.03288  [pdf, other

    cs.LG stat.ML

    How to Train Your Energy-Based Models

    Authors: Yang Song, Diederik P. Kingma

    Abstract: Energy-Based Models (EBMs), also known as non-normalized probabilistic models, specify probability density or mass functions up to an unknown normalizing constant. Unlike most other probabilistic models, EBMs do not place a restriction on the tractability of the normalizing constant, thus are more flexible to parameterize and can model a more expressive family of probability distributions. However… ▽ More

    Submitted 17 February, 2021; v1 submitted 8 January, 2021; originally announced January 2021.

  7. arXiv:2012.08125  [pdf, other

    cs.LG stat.ML

    Learning Energy-Based Models by Diffusion Recovery Likelihood

    Authors: Ruiqi Gao, Yang Song, Ben Poole, Ying Nian Wu, Diederik P. Kingma

    Abstract: While energy-based models (EBMs) exhibit a number of desirable properties, training and sampling on high-dimensional datasets remains challenging. Inspired by recent progress on diffusion probabilistic models, we present a diffusion recovery likelihood method to tractably learn and sample from a sequence of EBMs trained on increasingly noisy versions of a dataset. Each EBM is trained with recovery… ▽ More

    Submitted 27 March, 2021; v1 submitted 15 December, 2020; originally announced December 2020.

  8. arXiv:2011.13456  [pdf, other

    cs.LG stat.ML

    Score-Based Generative Modeling through Stochastic Differential Equations

    Authors: Yang Song, Jascha Sohl-Dickstein, Diederik P. Kingma, Abhishek Kumar, Stefano Ermon, Ben Poole

    Abstract: Creating noise from data is easy; creating data from noise is generative modeling. We present a stochastic differential equation (SDE) that smoothly transforms a complex data distribution to a known prior distribution by slowly injecting noise, and a corresponding reverse-time SDE that transforms the prior distribution back into the data distribution by slowly removing the noise. Crucially, the re… ▽ More

    Submitted 10 February, 2021; v1 submitted 26 November, 2020; originally announced November 2020.

    Comments: ICLR 2021 (Oral)

  9. arXiv:2011.03568  [pdf, other

    cs.CL cs.SD eess.AS

    Wave-Tacotron: Spectrogram-free end-to-end text-to-speech synthesis

    Authors: Ron J. Weiss, RJ Skerry-Ryan, Eric Battenberg, Soroosh Mariooryad, Diederik P. Kingma

    Abstract: We describe a sequence-to-sequence neural network which directly generates speech waveforms from text inputs. The architecture extends the Tacotron model by incorporating a normalizing flow into the autoregressive decoder loop. Output waveforms are modeled as a sequence of non-overlapping fixed-length blocks, each one containing hundreds of samples. The interdependencies of waveform samples within… ▽ More

    Submitted 5 February, 2021; v1 submitted 6 November, 2020; originally announced November 2020.

    Comments: 6 pages including supplement, 3 figures. accepted to ICASSP 2021

  10. arXiv:2007.00810  [pdf, other

    stat.ML cs.LG

    On Linear Identifiability of Learned Representations

    Authors: Geoffrey Roeder, Luke Metz, Diederik P. Kingma

    Abstract: Identifiability is a desirable property of a statistical model: it implies that the true model parameters may be estimated to any desired precision, given sufficient computational resources and data. We study identifiability in the context of representation learning: discovering nonlinear data representations that are optimal with respect to some downstream task. When parameterized as deep neural… ▽ More

    Submitted 7 July, 2020; v1 submitted 1 July, 2020; originally announced July 2020.

  11. arXiv:2002.11537  [pdf, other

    stat.ML cs.LG

    ICE-BeeM: Identifiable Conditional Energy-Based Deep Models Based on Nonlinear ICA

    Authors: Ilyes Khemakhem, Ricardo Pio Monti, Diederik P. Kingma, Aapo Hyvärinen

    Abstract: We consider the identifiability theory of probabilistic models and establish sufficient conditions under which the representations learned by a very broad family of conditional energy-based models are unique in function space, up to a simple transformation. In our model family, the energy function is the dot-product between two feature extractors, one for the dependent variable, and one for the co… ▽ More

    Submitted 26 October, 2020; v1 submitted 26 February, 2020; originally announced February 2020.

    Comments: Accepted for publication at NeurIPS 2020

  12. arXiv:1912.00589  [pdf, other

    stat.ML cs.CV cs.LG

    Flow Contrastive Estimation of Energy-Based Models

    Authors: Ruiqi Gao, Erik Nijkamp, Diederik P. Kingma, Zhen Xu, Andrew M. Dai, Ying Nian Wu

    Abstract: This paper studies a training method to jointly estimate an energy-based model and a flow-based model, in which the two models are iteratively updated based on a shared adversarial value function. This joint training method has the following traits. (1) The update of the energy-based model is based on noise contrastive estimation, with the flow model serving as a strong noise distribution. (2) The… ▽ More

    Submitted 1 April, 2020; v1 submitted 2 December, 2019; originally announced December 2019.

  13. arXiv:1907.04809  [pdf, other

    stat.ML cs.LG

    Variational Autoencoders and Nonlinear ICA: A Unifying Framework

    Authors: Ilyes Khemakhem, Diederik P. Kingma, Ricardo Pio Monti, Aapo Hyvärinen

    Abstract: The framework of variational autoencoders allows us to efficiently learn deep latent-variable models, such that the model's marginal distribution over observed variables fits the data. Often, we're interested in going a step further, and want to approximate the true joint distribution over observed and latent variables, including the true prior and posterior distributions over latent variables. Th… ▽ More

    Submitted 21 December, 2020; v1 submitted 10 July, 2019; originally announced July 2019.

    Comments: Accepted for publication at AISTATS 2020. This is a slightly updated version of the published manuscript; see Corrigendum at the end of the paper

    Journal ref: Proceedings of the Twenty Third International Conference on Artificial Intelligence and Statistics, pages 2207-2217, year 2020

  14. arXiv:1906.02691  [pdf, other

    cs.LG stat.ML

    An Introduction to Variational Autoencoders

    Authors: Diederik P. Kingma, Max Welling

    Abstract: Variational autoencoders provide a principled framework for learning deep latent-variable models and corresponding inference models. In this work, we provide an introduction to variational autoencoders and some important extensions.

    Submitted 11 December, 2019; v1 submitted 6 June, 2019; originally announced June 2019.

    Journal ref: Foundations and Trends in Machine Learning: Vol. 12 (2019): No. 4, pp 307-392

  15. arXiv:1807.03039  [pdf, other

    stat.ML cs.AI cs.LG

    Glow: Generative Flow with Invertible 1x1 Convolutions

    Authors: Diederik P. Kingma, Prafulla Dhariwal

    Abstract: Flow-based generative models (Dinh et al., 2014) are conceptually attractive due to tractability of the exact log-likelihood, tractability of exact latent-variable inference, and parallelizability of both training and synthesis. In this paper we propose Glow, a simple type of generative flow using an invertible 1x1 convolution. Using our method we demonstrate a significant improvement in log-likel… ▽ More

    Submitted 10 July, 2018; v1 submitted 9 July, 2018; originally announced July 2018.

    Comments: 15 pages; fixed typo in abstract

  16. arXiv:1712.01312  [pdf, other

    stat.ML cs.LG

    Learning Sparse Neural Networks through $L_0$ Regularization

    Authors: Christos Louizos, Max Welling, Diederik P. Kingma

    Abstract: We propose a practical method for $L_0$ norm regularization for neural networks: pruning the network during training by encouraging weights to become exactly zero. Such regularization is interesting since (1) it can greatly speed up training and inference, and (2) it can improve generalization. AIC and BIC, well-known model selection criteria, are special cases of $L_0$ regularization. However, si… ▽ More

    Submitted 22 June, 2018; v1 submitted 4 December, 2017; originally announced December 2017.

    Comments: Published as a conference paper at the International Conference on Learning Representations (ICLR) 2018

  17. arXiv:1701.05517  [pdf, other

    cs.LG stat.ML

    PixelCNN++: Improving the PixelCNN with Discretized Logistic Mixture Likelihood and Other Modifications

    Authors: Tim Salimans, Andrej Karpathy, Xi Chen, Diederik P. Kingma

    Abstract: PixelCNNs are a recently proposed class of powerful generative models with tractable likelihood. Here we discuss our implementation of PixelCNNs which we make available at https://github.com/openai/pixel-cnn. Our implementation contains a number of modifications to the original model that both simplify its structure and improve its performance. 1) We use a discretized logistic mixture likelihood o… ▽ More

    Submitted 19 January, 2017; originally announced January 2017.

  18. arXiv:1611.02731  [pdf, other

    cs.LG stat.ML

    Variational Lossy Autoencoder

    Authors: Xi Chen, Diederik P. Kingma, Tim Salimans, Yan Duan, Prafulla Dhariwal, John Schulman, Ilya Sutskever, Pieter Abbeel

    Abstract: Representation learning seeks to expose certain aspects of observed data in a learned representation that's amenable to downstream tasks like classification. For instance, a good representation for 2D images might be one that describes only global structure and discards information about detailed texture. In this paper, we present a simple but principled method to learn such global representations… ▽ More

    Submitted 4 March, 2017; v1 submitted 8 November, 2016; originally announced November 2016.

    Comments: Added CIFAR10 experiments; ICLR 2017

  19. arXiv:1606.04934  [pdf, other

    cs.LG stat.ML

    Improving Variational Inference with Inverse Autoregressive Flow

    Authors: Diederik P. Kingma, Tim Salimans, Rafal Jozefowicz, Xi Chen, Ilya Sutskever, Max Welling

    Abstract: The framework of normalizing flows provides a general strategy for flexible variational inference of posteriors over latent variables. We propose a new type of normalizing flow, inverse autoregressive flow (IAF), that, in contrast to earlier published flows, scales well to high-dimensional latent spaces. The proposed flow consists of a chain of invertible transformations, where each transformation… ▽ More

    Submitted 30 January, 2017; v1 submitted 15 June, 2016; originally announced June 2016.

  20. arXiv:1602.07868  [pdf, other

    cs.LG cs.AI cs.NE

    Weight Normalization: A Simple Reparameterization to Accelerate Training of Deep Neural Networks

    Authors: Tim Salimans, Diederik P. Kingma

    Abstract: We present weight normalization: a reparameterization of the weight vectors in a neural network that decouples the length of those weight vectors from their direction. By reparameterizing the weights in this way we improve the conditioning of the optimization problem and we speed up convergence of stochastic gradient descent. Our reparameterization is inspired by batch normalization but does not i… ▽ More

    Submitted 3 June, 2016; v1 submitted 25 February, 2016; originally announced February 2016.

  21. arXiv:1506.02557  [pdf, other

    stat.ML cs.LG stat.CO

    Variational Dropout and the Local Reparameterization Trick

    Authors: Diederik P. Kingma, Tim Salimans, Max Welling

    Abstract: We investigate a local reparameterizaton technique for greatly reducing the variance of stochastic gradients for variational Bayesian inference (SGVB) of a posterior over model parameters, while retaining parallelizability. This local reparameterization translates uncertainty about global parameters into local noise that is independent across datapoints in the minibatch. Such parameterizations can… ▽ More

    Submitted 20 December, 2015; v1 submitted 8 June, 2015; originally announced June 2015.

  22. arXiv:1504.08025  [pdf, ps, other


    Note on Equivalence Between Recurrent Neural Network Time Series Models and Variational Bayesian Models

    Authors: Jascha Sohl-Dickstein, Diederik P. Kingma

    Abstract: We observe that the standard log likelihood training objective for a Recurrent Neural Network (RNN) model of time series data is equivalent to a variational Bayesian training objective, given the proper choice of generative and inference models. This perspective may motivate extensions to both RNNs and variational Bayesian models. We propose one such extension, where multiple particles are used fo… ▽ More

    Submitted 18 June, 2016; v1 submitted 29 April, 2015; originally announced April 2015.

  23. arXiv:1412.6980  [pdf, other


    Adam: A Method for Stochastic Optimization

    Authors: Diederik P. Kingma, Jimmy Ba

    Abstract: We introduce Adam, an algorithm for first-order gradient-based optimization of stochastic objective functions, based on adaptive estimates of lower-order moments. The method is straightforward to implement, is computationally efficient, has little memory requirements, is invariant to diagonal rescaling of the gradients, and is well suited for problems that are large in terms of data and/or paramet… ▽ More

    Submitted 29 January, 2017; v1 submitted 22 December, 2014; originally announced December 2014.

    Comments: Published as a conference paper at the 3rd International Conference for Learning Representations, San Diego, 2015

  24. arXiv:1406.5298  [pdf, other

    cs.LG stat.ML

    Semi-Supervised Learning with Deep Generative Models

    Authors: Diederik P. Kingma, Danilo J. Rezende, Shakir Mohamed, Max Welling

    Abstract: The ever-increasing size of modern data sets combined with the difficulty of obtaining label information has made semi-supervised learning one of the problems of significant practical importance in modern data analysis. We revisit the approach to semi-supervised learning with generative models and develop new models that allow for effective generalisation from small labelled data sets to large unl… ▽ More

    Submitted 31 October, 2014; v1 submitted 20 June, 2014; originally announced June 2014.

    Comments: To appear in the proceedings of Neural Information Processing Systems (NIPS) 2014

  25. arXiv:1402.0480  [pdf, other

    cs.LG stat.ML

    Efficient Gradient-Based Inference through Transformations between Bayes Nets and Neural Nets

    Authors: Diederik P. Kingma, Max Welling

    Abstract: Hierarchical Bayesian networks and neural networks with stochastic hidden units are commonly perceived as two separate types of models. We show that either of these types of models can often be transformed into an instance of the other, by switching between centered and differentiable non-centered parameterizations of the latent variables. The choice of parameterization greatly influences the effi… ▽ More

    Submitted 22 January, 2015; v1 submitted 3 February, 2014; originally announced February 2014.

    Journal ref: Proceedings of The 31st International Conference on Machine Learning, pp. 1782-1790, 2014

  26. arXiv:1312.6114  [pdf, other

    stat.ML cs.LG

    Auto-Encoding Variational Bayes

    Authors: Diederik P Kingma, Max Welling

    Abstract: How can we perform efficient inference and learning in directed probabilistic models, in the presence of continuous latent variables with intractable posterior distributions, and large datasets? We introduce a stochastic variational inference and learning algorithm that scales to large datasets and, under some mild differentiability conditions, even works in the intractable case. Our contributions… ▽ More

    Submitted 10 December, 2022; v1 submitted 20 December, 2013; originally announced December 2013.

    Comments: Fixes a typo in the abstract, no other changes

  27. arXiv:1306.0733  [pdf, other

    cs.LG stat.ML

    Fast Gradient-Based Inference with Continuous Latent Variable Models in Auxiliary Form

    Authors: Diederik P Kingma

    Abstract: We propose a technique for increasing the efficiency of gradient-based inference and learning in Bayesian networks with multiple layers of continuous latent vari- ables. We show that, in many cases, it is possible to express such models in an auxiliary form, where continuous latent variables are conditionally deterministic given their parents and a set of independent auxiliary variables. Variables… ▽ More

    Submitted 4 June, 2013; originally announced June 2013.