Representative Selection for Big Data via Sparse Graph and Geodesic Grassmann Manifold Distance

Dang, Chinh; Radha, Hayder

Computer Science > Computer Vision and Pattern Recognition

arXiv:1405.1681 (cs)

This paper has been withdrawn by Chinh Dang

[Submitted on 7 May 2014 (v1), last revised 11 Mar 2015 (this version, v2)]

Title:Representative Selection for Big Data via Sparse Graph and Geodesic Grassmann Manifold Distance

Authors:Chinh Dang, Hayder Radha

No PDF available, click to view other formats

Abstract:This paper addresses the problem of identifying a very small subset of data points that belong to a significantly larger massive dataset (i.e., Big Data). The small number of selected data points must adequately represent and faithfully characterize the massive Big Data. Such identification process is known as representative selection [19]. We propose a novel representative selection framework by generating an l1 norm sparse graph for a given Big-Data dataset. The Big Data is partitioned recursively into clusters using a spectral clustering algorithm on the generated sparse graph. We consider each cluster as one point in a Grassmann manifold, and measure the geodesic distance among these points. The distances are further analyzed using a min-max algorithm [1] to extract an optimal subset of clusters. Finally, by considering a sparse subgraph of each selected cluster, we detect a representative using principal component centrality [11]. We refer to the proposed representative selection framework as a Sparse Graph and Grassmann Manifold (SGGM) based approach. To validate the proposed SGGM framework, we apply it onto the problem of video summarization where only few video frames, known as key frames, are selected among a much longer video sequence. A comparison of the results obtained by the proposed algorithm with the ground truth, which is agreed by multiple human judges, and with some state-of-the-art methods clearly indicates the viability of the SGGM framework.

Comments:	This paper has been withdrawn by the author due to lacking details
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:1405.1681 [cs.CV]
	(or arXiv:1405.1681v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1405.1681

Submission history

From: Chinh Dang [view email]
[v1] Wed, 7 May 2014 17:57:25 UTC (331 KB)
[v2] Wed, 11 Mar 2015 13:57:52 UTC (1 KB) (withdrawn)

Computer Science > Computer Vision and Pattern Recognition

Title:Representative Selection for Big Data via Sparse Graph and Geodesic Grassmann Manifold Distance

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Representative Selection for Big Data via Sparse Graph and Geodesic Grassmann Manifold Distance

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators