Unsupervised Learning of GMM with a Uniform Background Component

Liu, Sida; Barbu, Adrian

Statistics > Machine Learning

arXiv:1804.02744 (stat)

[Submitted on 8 Apr 2018 (v1), last revised 20 Mar 2020 (this version, v4)]

Title:Unsupervised Learning of GMM with a Uniform Background Component

Authors:Sida Liu, Adrian Barbu

View PDF

Abstract:Gaussian Mixture Models are one of the most studied and mature models in unsupervised learning. However, outliers are often present in the data and could influence the cluster estimation. In this paper, we study a new model that assumes that data comes from a mixture of a number of Gaussians as well as a uniform ``background'' component assumed to contain outliers and other non-interesting observations. We develop a novel method based on robust loss minimization that performs well in clustering such GMM with a uniform background. We give theoretical guarantees for our clustering algorithm to obtain best clustering results with high probability. Besides, we show that the result of our algorithm does not depend on initialization or local optima, and the parameter tuning is an easy task. By numeric simulations, we demonstrate that our algorithm enjoys high accuracy and achieves the best clustering results given a large enough sample size. Finally, experimental comparisons with typical clustering methods on real datasets witness the potential of our algorithm in real applications.

Comments:	36 pages, 16 figures and 4 tables
Subjects:	Machine Learning (stat.ML); Machine Learning (cs.LG)
Cite as:	arXiv:1804.02744 [stat.ML]
	(or arXiv:1804.02744v4 [stat.ML] for this version)
	https://doi.org/10.48550/arXiv.1804.02744

Submission history

From: Sida Liu [view email]
[v1] Sun, 8 Apr 2018 19:27:39 UTC (3,950 KB)
[v2] Sun, 27 May 2018 14:42:25 UTC (4,498 KB)
[v3] Thu, 6 Dec 2018 00:19:16 UTC (8,845 KB)
[v4] Fri, 20 Mar 2020 19:15:55 UTC (8,785 KB)

Statistics > Machine Learning

Title:Unsupervised Learning of GMM with a Uniform Background Component

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Statistics > Machine Learning

Title:Unsupervised Learning of GMM with a Uniform Background Component

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators