Sign Stable Projections, Sign Cauchy Projections and Chi-Square Kernels

Li, Ping; Samorodnitsky, Gennady; Hopcroft, John

Computer Science > Machine Learning

arXiv:1308.1009 (cs)

[Submitted on 5 Aug 2013]

Title:Sign Stable Projections, Sign Cauchy Projections and Chi-Square Kernels

Authors:Ping Li, Gennady Samorodnitsky, John Hopcroft

View PDF

Abstract:The method of stable random projections is popular for efficiently computing the Lp distances in high dimension (where 0<p<=2), using small space. Because it adopts nonadaptive linear projections, this method is naturally suitable when the data are collected in a dynamic streaming fashion (i.e., turnstile data streams). In this paper, we propose to use only the signs of the projected data and analyze the probability of collision (i.e., when the two signs differ). We derive a bound of the collision probability which is exact when p=2 and becomes less sharp when p moves away from 2. Interestingly, when p=1 (i.e., Cauchy random projections), we show that the probability of collision can be accurately approximated as functions of the chi-square similarity. For example, when the (un-normalized) data are binary, the maximum approximation error of the collision probability is smaller than 0.0192. In text and vision applications, the chi-square similarity is a popular measure for nonnegative data when the features are generated from histograms. Our experiments confirm that the proposed method is promising for large-scale learning applications.

Subjects:	Machine Learning (cs.LG); Data Structures and Algorithms (cs.DS); Information Retrieval (cs.IR)
Cite as:	arXiv:1308.1009 [cs.LG]
	(or arXiv:1308.1009v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1308.1009

Submission history

From: Ping Li [view email]
[v1] Mon, 5 Aug 2013 15:25:51 UTC (143 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2013-08

Change to browse by:

cs
cs.DS
cs.IR

References & Citations

DBLP - CS Bibliography

listing | bibtex

Ping Li
Gennady Samorodnitsky
John E. Hopcroft

export BibTeX citation

Computer Science > Machine Learning

Title:Sign Stable Projections, Sign Cauchy Projections and Chi-Square Kernels

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Sign Stable Projections, Sign Cauchy Projections and Chi-Square Kernels

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators