On Variational Learning of Controllable Representations for Text without Supervision

Xu, Peng; Cheung, Jackie Chi Kit; Cao, Yanshuai

Computer Science > Computation and Language

arXiv:1905.11975 (cs)

[Submitted on 28 May 2019 (v1), last revised 7 Aug 2020 (this version, v4)]

Title:On Variational Learning of Controllable Representations for Text without Supervision

Authors:Peng Xu, Jackie Chi Kit Cheung, Yanshuai Cao

View PDF

Abstract:The variational autoencoder (VAE) can learn the manifold of natural images on certain datasets, as evidenced by meaningful interpolating or extrapolating in the continuous latent space. However, on discrete data such as text, it is unclear if unsupervised learning can discover similar latent space that allows controllable manipulation. In this work, we find that sequence VAEs trained on text fail to properly decode when the latent codes are manipulated, because the modified codes often land in holes or vacant regions in the aggregated posterior latent space, where the decoding network fails to generalize. Both as a validation of the explanation and as a fix to the problem, we propose to constrain the posterior mean to a learned probability simplex, and performs manipulation within this simplex. Our proposed method mitigates the latent vacancy problem and achieves the first success in unsupervised learning of controllable representations for text. Empirically, our method outperforms unsupervised baselines and strong supervised approaches on text style transfer, and is capable of performing more flexible fine-grained control over text generation than existing methods.

Comments:	ICML 2020 Camera Ready. Previous title: Unsupervised Controllable Text Generation with Global Variation Discovery and Disentanglement
Subjects:	Computation and Language (cs.CL); Machine Learning (cs.LG)
Cite as:	arXiv:1905.11975 [cs.CL]
	(or arXiv:1905.11975v4 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.1905.11975

Submission history

From: Peng Xu [view email]
[v1] Tue, 28 May 2019 17:49:47 UTC (365 KB)
[v2] Sat, 12 Oct 2019 02:47:53 UTC (617 KB)
[v3] Fri, 7 Feb 2020 21:42:52 UTC (1,086 KB)
[v4] Fri, 7 Aug 2020 17:44:10 UTC (1,094 KB)

Computer Science > Computation and Language

Title:On Variational Learning of Controllable Representations for Text without Supervision

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:On Variational Learning of Controllable Representations for Text without Supervision

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators