Sparse Kernel PCA for Outlier Detection

Das, Rudrajit; Golatkar, Aditya; Awate, Suyash P.

Computer Science > Machine Learning

arXiv:1809.02497 (cs)

[Submitted on 7 Sep 2018 (v1), last revised 13 Sep 2018 (this version, v2)]

Title:Sparse Kernel PCA for Outlier Detection

Authors:Rudrajit Das, Aditya Golatkar, Suyash P. Awate

View PDF

Abstract:In this paper, we propose a new method to perform Sparse Kernel Principal Component Analysis (SKPCA) and also mathematically analyze the validity of SKPCA. We formulate SKPCA as a constrained optimization problem with elastic net regularization (Hastie et al.) in kernel feature space and solve it. We consider outlier detection (where KPCA is employed) as an application for SKPCA, using the RBF kernel. We test it on 5 real-world datasets and show that by using just 4% (or even less) of the principal components (PCs), where each PC has on average less than 12% non-zero elements in the worst case among all 5 datasets, we are able to nearly match and in 3 datasets even outperform KPCA. We also compare the performance of our method with a recently proposed method for SKPCA by Wang et al. and show that our method performs better in terms of both accuracy and sparsity. We also provide a novel probabilistic proof to justify the existence of sparse solutions for KPCA using the RBF kernel. To the best of our knowledge, this is the first attempt at theoretically analyzing the validity of SKPCA.

Comments:	Accepted at IEEE ICMLA 2018 for Oral Presentation
Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:1809.02497 [cs.LG]
	(or arXiv:1809.02497v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1809.02497

Submission history

From: Rudrajit Das [view email]
[v1] Fri, 7 Sep 2018 14:23:03 UTC (331 KB)
[v2] Thu, 13 Sep 2018 18:35:15 UTC (482 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2018-09

Change to browse by:

cs
stat
stat.ML

References & Citations

DBLP - CS Bibliography

listing | bibtex

Rudrajit Das
Aditya Golatkar
Suyash P. Awate

export BibTeX citation

Computer Science > Machine Learning

Title:Sparse Kernel PCA for Outlier Detection

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Sparse Kernel PCA for Outlier Detection

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators