research-article

Diffusion maps-based image clustering

Authors:

F. FotouhiAuthors Info & Claims

IWRIDL '06: Proceedings of the 2006 international workshop on Research issues in digital libraries

Article No.: 9, Pages 1 - 7

https://doi.org/10.1145/1364742.1364754

Published: 12 December 2006 Publication History

Abstract

In the clustering of large number of images using low-level features, one of the problems encountered is the high dimensional feature space. The high dimensionality of feature spaces leads to unnecessary cost in feature selection and also in the distance measurement during the clustering process. In this paper, we propose an approach to reduce the dimensionality of the feature space based on diffusion maps. In the proposed approach, each image is represented by a set of tiles. A visual keyword-image matrix is derived from classifying these tiles into a set of clusters and counting the occurrence of each cluster in each image of our database. The visual keyword-image matrix is similar to the term-document matrix in information retrieval. We use diffusion maps to reduce the dimensionality of visual keyword matrix. By reducing the dimensionality of the image representation, we can save computation cost significantly. We compare the performance between the proposed approach and the approach that uses the global MPEG-7 color descriptors. The results demonstrate the improvements.

References

[1]

Agrawal, R., Grosky, W. I., and Fotouhi, F. Image clustering using multimodal keywords. In First International Conference on Semantic and Digital Multimedia Technologies (Athens, Greece, Dec. 2006), 113--123.

Digital Library

[2]

Agrawal, R., Grosky, W. I., and Fotouhi, F. Image retrieval using multimodal keywords. In Second IEEE International Workshop on Multimedia Information Processing and Retrieval (San Diego, CA, Dec. 2006), 817--822.

Digital Library

[3]

Belkin, M. and Niyogi, P. Laplacian eigenmaps for dimensionality reduction and data representation. Neural Computation, 15, 6 (2003), 1373--1396

Digital Library

[4]

Bhattacharya, A., Ljosa, V., Pan, J., Verardo, R., Yang, H., Faloutsos, C., and Singh, A. K. Vivo: Visual vocabulary construction for mining biomedical images. In Proceedings of the Fifth IEEE International Conference on Data Mining (New Orleans, LA, USA, Nov. 2005), 50--57.

Digital Library

[5]

Carson, C., Belonge, S., Greenspan, H., and Malik, J. Blob-world: A system for region-based image indexing and retrieval. In Proceedings of the First International Conference on Visual Information Systems (Amsterdam, The Netherlands, June 1999), Lecture Notes in Computer Science, 1614, Springer, 1999, 509--516.

Digital Library

[6]

Coifman, R. R. and Lafon, S. Diffusion maps. Applied and Computational Harmonic Analysis, 12, 1 (July 2006), 5--30.

[7]

Cox, T. and Cox, M. Multidimensional Scaling. Chapman & Hall, London, UK, 1994.

[8]

Deerwester, A., Dumais, S. T., Landauer, T. K., Furnas, G. W., and Harshman, R. A. Indexing by latent semantic analysis. Journal of the American Society of Information Science, 41, 6 (Oct. 2001), 391--407.

[9]

Draper, B. A., Baek, K., Barlett, M. S., and Beveridge, J. R. Recognizing faces with PCA and ICA. Computer Vision and Image Understanding, 91 (2003), 115--137.

Digital Library

[10]

Karypis, G. Cluto: A clustering toolkit, release 2.1.1. Technical Report 02-017, University of Minnesota, Department of Computer Science, 2003.

[11]

Kasutani, E. and Yamada, A. The MPEG-7 color layout descriptor: A compact image feature description for highspeed image/video segment retrieval. In Proceedings of ICIP (Thessaloniki, Greece, Oct. 2001), 674--677.

[12]

Kaufman, L. and Rousseeuw, P. J. Finding Groups in Data: An Introduction to Cluster Analysis. John Wiley & Sons, Indianapolis, Indiana, USA, 1990.

[13]

Lafon, S. and Lee A. B. Diffusion maps and coarse-graining: A unified framework for dimensionality reduction, graph partitioning, and data set parameterization. IEEE TPAMI, 28, 9 (Sept. 2006), 1393--1403.

Digital Library

[14]

Law, M., Topchy, A., and Jain. A. K. Multiobjective data clustering. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (Washington, D.C., USA, June 2004), 424--430.

Digital Library

[15]

Manjunath, B. S., Salembier, P., and Sikor, T. (Eds.), Introduction to MPEG-7 Multimedia Content Description Interface, John Wiley & Sons, Indianapolis, Indiana, 2002.

Digital Library

[16]

Meila, M. and Shi, J. A random walks view of spectral segmentation. In International Conference on AI and Statistics (Key West, FL, USA, Jan. 2001).

[17]

Milos, http://milos.isti.cnr.it

[18]

Roweis, S. and Saul, L. Nonlinear dimensionality reduction by locally linear embedding. Science, 290 (2000), 2323--2326.

[19]

Salton, G. and McGill, M. J. Introduction to modern retrieval. McGraw Hill Book Company, New York, NY, 1983.

Digital Library

[20]

Shi, J. and Malik, J. Normalized cuts and image segmentation. IEEE TPAMI, 22, 8 (2000), 888--905.

Digital Library

[21]

Smeulders, W. M., Worring, M., Santini, S., Gupta, A., and Jain, R. Content based retrieval at the end of the early years. IEEE TPAMI, 22, 12 (2000), 1349--1380.

Digital Library

[22]

Sreenath, D. V., Grosky, W. I., and Fotouhi, F. Using coherent semantic subpaths to derive emergent semantics. In Proceedings of the Eighth International Conference in Knowledge-Based Intelligent Information and Engineering Systems (Wellington, New Zealand, Sept. 2004), 173--179.

[23]

Tang, J., Hare, J. S., and Lewis, P. H. Image auto-annotation using a statistical model with salient regions. In IEEE International Conference on Multimedia & Expo (Toronto, CA, July 2006), 525--528.

[24]

Tenenbaum, J., de Silva, V., and Langford, J. A global geometric framework for nonlinear dimensionality reduction. Science, 290 (2000), 2319--2323.

[25]

Torralba, A., Murphy, K. P., Freeman, W. T., and Russell, B. C. LabelMe: A database and web-based tool for image annotation. MIT AI Lab, AIM-2005-025 Edition, 2005.

[26]

Turk, M. A. and Pentland, A. P. Eigenfaces for recognition. Journal of Cognitive Neuroscience, 3, 1 (1991), 71--96.

Digital Library

[27]

Vailaya, A., Figueiredo, M., Jain, A., and Zhang, H. Image classification for content-based indexing. IEEE TIP, 10, 1 (2001).

Digital Library

[28]

Van Rijsbergen, C. J., Robertson, S. E., and Porter, M. F. New models in probabilistic information retrieval. British Library Research and Development Report, 1980.

[29]

Weiss, Y. Segmentation using eigenvectors: a unifying view. In Proceedings off the IEEE International Conference on Computer Vision (Kerkyra, Greece, Sept. 1999), 975--982.

Digital Library

[30]

Zhao, R. and Grosky, W. I. Narrowing the semantic gap -- improved text-based web document retrieval using visual features, IEEE Transactions on Multimedia, 4, 2 (June 2002), 189--200.

Digital Library

Cited By

Yu J(2013)Cartoon features selection using Diffusion ScoreSignal Processing10.1016/j.sigpro.2012.06.01193:6(1510-1520)Online publication date: 1-Jun-2013
https://dl.acm.org/doi/10.1016/j.sigpro.2012.06.011

Index Terms

Diffusion maps-based image clustering
1. Software and its engineering
  1. Software notations and tools
    1. General programming languages
      1. Language features

Recommendations

Image Clustering Using Discriminant Image Features
FIT '13: Proceedings of the 2013 11th International Conference on Frontiers of Information Technology

Manifold learning based image clustering models are usually employed at local level to deal with images sampled from nonlinear manifold. Usually, gray level image features are used that are obtained by resizing original images through linear ...
Grassmannian Diffusion Maps--Based Dimension Reduction and Classification for High-Dimensional Data

This work introduces the Grassmannian diffusion maps (GDMaps), a novel nonlinear dimensionality reduction technique that defines the affinity between points through their representation as low-dimensional subspaces corresponding to points on the Grassmann ...
Image clustering with tensor representation
MULTIMEDIA '05: Proceedings of the 13th annual ACM international conference on Multimedia

We consider the problem of image representation and clustering. Traditionally, an n₁ x n₂ image is represented by a vector in the Euclidean space ℝ ^{n1 x n2}. Some learning algorithms are then applied to these vectors in such a high dimensional space for ...

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences

IWRIDL '06: Proceedings of the 2006 international workshop on Research issues in digital libraries

December 2006

121 pages

ISBN:1595936084

DOI:10.1145/1364742

Copyright © 2006 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

SIGIR: ACM Special Interest Group on Information Retrieval

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 12 December 2006

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Conference

IWRIDL06

Sponsor:

SIGIR

IWRIDL06: International Workshop on Research Issues in Digital Libraries 2006

December 12 - 15, 2006

Kolkata, India

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

1
Total Citations
View Citations
216
Total Downloads

Downloads (Last 12 months)7
Downloads (Last 6 weeks)0

Reflects downloads up to 03 Feb 2025

Other Metrics

View Author Metrics

Citations

Cited By

Yu J(2013)Cartoon features selection using Diffusion ScoreSignal Processing10.1016/j.sigpro.2012.06.01193:6(1510-1520)Online publication date: 1-Jun-2013
https://dl.acm.org/doi/10.1016/j.sigpro.2012.06.011

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Figures

Tables

Media

View Table of Conten