SQ-VAE: Variational Bayes on Discrete Representation with Self-annealed Stochastic Quantization

Takida, Yuhta; Shibuya, Takashi; Liao, WeiHsiang; Lai, Chieh-Hsin; Ohmura, Junki; Uesaka, Toshimitsu; Murata, Naoki; Takahashi, Shusuke; Kumakura, Toshiyuki; Mitsufuji, Yuki

Computer Science > Machine Learning

arXiv:2205.07547 (cs)

[Submitted on 16 May 2022 (v1), last revised 9 Jun 2022 (this version, v2)]

Title:SQ-VAE: Variational Bayes on Discrete Representation with Self-annealed Stochastic Quantization

Authors:Yuhta Takida, Takashi Shibuya, WeiHsiang Liao, Chieh-Hsin Lai, Junki Ohmura, Toshimitsu Uesaka, Naoki Murata, Shusuke Takahashi, Toshiyuki Kumakura, Yuki Mitsufuji

View PDF

Abstract:One noted issue of vector-quantized variational autoencoder (VQ-VAE) is that the learned discrete representation uses only a fraction of the full capacity of the codebook, also known as codebook collapse. We hypothesize that the training scheme of VQ-VAE, which involves some carefully designed heuristics, underlies this issue. In this paper, we propose a new training scheme that extends the standard VAE via novel stochastic dequantization and quantization, called stochastically quantized variational autoencoder (SQ-VAE). In SQ-VAE, we observe a trend that the quantization is stochastic at the initial stage of the training but gradually converges toward a deterministic quantization, which we call self-annealing. Our experiments show that SQ-VAE improves codebook utilization without using common heuristics. Furthermore, we empirically show that SQ-VAE is superior to VAE and VQ-VAE in vision- and speech-related tasks.

Comments:	25 pages with 10 figures, accepted for publication in ICML 2022 (Our code is available at this https URL)
Subjects:	Machine Learning (cs.LG); Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2205.07547 [cs.LG]
	(or arXiv:2205.07547v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2205.07547

Submission history

From: Yuhta Takida [view email]
[v1] Mon, 16 May 2022 09:49:37 UTC (16,217 KB)
[v2] Thu, 9 Jun 2022 12:46:05 UTC (15,081 KB)

Computer Science > Machine Learning

Title:SQ-VAE: Variational Bayes on Discrete Representation with Self-annealed Stochastic Quantization

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:SQ-VAE: Variational Bayes on Discrete Representation with Self-annealed Stochastic Quantization

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators