Learning Interpretable and Discrete Representations with Adversarial Training for Unsupervised Text Classification

Wang, Yau-Shian; Lee, Hung-Yi; Chen, Yun-Nung

Computer Science > Computation and Language

arXiv:2004.13255 (cs)

[Submitted on 28 Apr 2020]

Title:Learning Interpretable and Discrete Representations with Adversarial Training for Unsupervised Text Classification

Authors:Yau-Shian Wang, Hung-Yi Lee, Yun-Nung Chen

View PDF

Abstract:Learning continuous representations from unlabeled textual data has been increasingly studied for benefiting semi-supervised learning. Although it is relatively easier to interpret discrete representations, due to the difficulty of training, learning discrete representations for unlabeled textual data has not been widely explored. This work proposes TIGAN that learns to encode texts into two disentangled representations, including a discrete code and a continuous noise, where the discrete code represents interpretable topics, and the noise controls the variance within the topics. The discrete code learned by TIGAN can be used for unsupervised text classification. Compared to other unsupervised baselines, the proposed TIGAN achieves superior performance on six different corpora. Also, the performance is on par with a recently proposed weakly-supervised text classification method. The extracted topical words for representing latent topics show that TIGAN learns coherent and highly interpretable topics.

Comments:	14 pages
Subjects:	Computation and Language (cs.CL); Information Retrieval (cs.IR); Machine Learning (cs.LG)
Cite as:	arXiv:2004.13255 [cs.CL]
	(or arXiv:2004.13255v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2004.13255

Submission history

From: Yaushian Wang [view email]
[v1] Tue, 28 Apr 2020 02:53:59 UTC (144 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CL

< prev | next >

new | recent | 2020-04

Change to browse by:

cs
cs.IR
cs.LG

References & Citations

DBLP - CS Bibliography

listing | bibtex

Yau-Shian Wang
Hung-yi Lee
Yun-Nung Chen

export BibTeX citation

Computer Science > Computation and Language

Title:Learning Interpretable and Discrete Representations with Adversarial Training for Unsupervised Text Classification

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Learning Interpretable and Discrete Representations with Adversarial Training for Unsupervised Text Classification

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators