Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/1364742.1364754acmconferencesArticle/Chapter ViewAbstractPublication PagesiwridlConference Proceedingsconference-collections
research-article

Diffusion maps-based image clustering

Published: 12 December 2006 Publication History

Abstract

In the clustering of large number of images using low-level features, one of the problems encountered is the high dimensional feature space. The high dimensionality of feature spaces leads to unnecessary cost in feature selection and also in the distance measurement during the clustering process. In this paper, we propose an approach to reduce the dimensionality of the feature space based on diffusion maps. In the proposed approach, each image is represented by a set of tiles. A visual keyword-image matrix is derived from classifying these tiles into a set of clusters and counting the occurrence of each cluster in each image of our database. The visual keyword-image matrix is similar to the term-document matrix in information retrieval. We use diffusion maps to reduce the dimensionality of visual keyword matrix. By reducing the dimensionality of the image representation, we can save computation cost significantly. We compare the performance between the proposed approach and the approach that uses the global MPEG-7 color descriptors. The results demonstrate the improvements.

References

[1]
Agrawal, R., Grosky, W. I., and Fotouhi, F. Image clustering using multimodal keywords. In First International Conference on Semantic and Digital Multimedia Technologies (Athens, Greece, Dec. 2006), 113--123.
[2]
Agrawal, R., Grosky, W. I., and Fotouhi, F. Image retrieval using multimodal keywords. In Second IEEE International Workshop on Multimedia Information Processing and Retrieval (San Diego, CA, Dec. 2006), 817--822.
[3]
Belkin, M. and Niyogi, P. Laplacian eigenmaps for dimensionality reduction and data representation. Neural Computation, 15, 6 (2003), 1373--1396
[4]
Bhattacharya, A., Ljosa, V., Pan, J., Verardo, R., Yang, H., Faloutsos, C., and Singh, A. K. Vivo: Visual vocabulary construction for mining biomedical images. In Proceedings of the Fifth IEEE International Conference on Data Mining (New Orleans, LA, USA, Nov. 2005), 50--57.
[5]
Carson, C., Belonge, S., Greenspan, H., and Malik, J. Blob-world: A system for region-based image indexing and retrieval. In Proceedings of the First International Conference on Visual Information Systems (Amsterdam, The Netherlands, June 1999), Lecture Notes in Computer Science, 1614, Springer, 1999, 509--516.
[6]
Coifman, R. R. and Lafon, S. Diffusion maps. Applied and Computational Harmonic Analysis, 12, 1 (July 2006), 5--30.
[7]
Cox, T. and Cox, M. Multidimensional Scaling. Chapman & Hall, London, UK, 1994.
[8]
Deerwester, A., Dumais, S. T., Landauer, T. K., Furnas, G. W., and Harshman, R. A. Indexing by latent semantic analysis. Journal of the American Society of Information Science, 41, 6 (Oct. 2001), 391--407.
[9]
Draper, B. A., Baek, K., Barlett, M. S., and Beveridge, J. R. Recognizing faces with PCA and ICA. Computer Vision and Image Understanding, 91 (2003), 115--137.
[10]
Karypis, G. Cluto: A clustering toolkit, release 2.1.1. Technical Report 02-017, University of Minnesota, Department of Computer Science, 2003.
[11]
Kasutani, E. and Yamada, A. The MPEG-7 color layout descriptor: A compact image feature description for highspeed image/video segment retrieval. In Proceedings of ICIP (Thessaloniki, Greece, Oct. 2001), 674--677.
[12]
Kaufman, L. and Rousseeuw, P. J. Finding Groups in Data: An Introduction to Cluster Analysis. John Wiley & Sons, Indianapolis, Indiana, USA, 1990.
[13]
Lafon, S. and Lee A. B. Diffusion maps and coarse-graining: A unified framework for dimensionality reduction, graph partitioning, and data set parameterization. IEEE TPAMI, 28, 9 (Sept. 2006), 1393--1403.
[14]
Law, M., Topchy, A., and Jain. A. K. Multiobjective data clustering. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (Washington, D.C., USA, June 2004), 424--430.
[15]
Manjunath, B. S., Salembier, P., and Sikor, T. (Eds.), Introduction to MPEG-7 Multimedia Content Description Interface, John Wiley & Sons, Indianapolis, Indiana, 2002.
[16]
Meila, M. and Shi, J. A random walks view of spectral segmentation. In International Conference on AI and Statistics (Key West, FL, USA, Jan. 2001).
[17]
Milos, http://milos.isti.cnr.it
[18]
Roweis, S. and Saul, L. Nonlinear dimensionality reduction by locally linear embedding. Science, 290 (2000), 2323--2326.
[19]
Salton, G. and McGill, M. J. Introduction to modern retrieval. McGraw Hill Book Company, New York, NY, 1983.
[20]
Shi, J. and Malik, J. Normalized cuts and image segmentation. IEEE TPAMI, 22, 8 (2000), 888--905.
[21]
Smeulders, W. M., Worring, M., Santini, S., Gupta, A., and Jain, R. Content based retrieval at the end of the early years. IEEE TPAMI, 22, 12 (2000), 1349--1380.
[22]
Sreenath, D. V., Grosky, W. I., and Fotouhi, F. Using coherent semantic subpaths to derive emergent semantics. In Proceedings of the Eighth International Conference in Knowledge-Based Intelligent Information and Engineering Systems (Wellington, New Zealand, Sept. 2004), 173--179.
[23]
Tang, J., Hare, J. S., and Lewis, P. H. Image auto-annotation using a statistical model with salient regions. In IEEE International Conference on Multimedia & Expo (Toronto, CA, July 2006), 525--528.
[24]
Tenenbaum, J., de Silva, V., and Langford, J. A global geometric framework for nonlinear dimensionality reduction. Science, 290 (2000), 2319--2323.
[25]
Torralba, A., Murphy, K. P., Freeman, W. T., and Russell, B. C. LabelMe: A database and web-based tool for image annotation. MIT AI Lab, AIM-2005-025 Edition, 2005.
[26]
Turk, M. A. and Pentland, A. P. Eigenfaces for recognition. Journal of Cognitive Neuroscience, 3, 1 (1991), 71--96.
[27]
Vailaya, A., Figueiredo, M., Jain, A., and Zhang, H. Image classification for content-based indexing. IEEE TIP, 10, 1 (2001).
[28]
Van Rijsbergen, C. J., Robertson, S. E., and Porter, M. F. New models in probabilistic information retrieval. British Library Research and Development Report, 1980.
[29]
Weiss, Y. Segmentation using eigenvectors: a unifying view. In Proceedings off the IEEE International Conference on Computer Vision (Kerkyra, Greece, Sept. 1999), 975--982.
[30]
Zhao, R. and Grosky, W. I. Narrowing the semantic gap -- improved text-based web document retrieval using visual features, IEEE Transactions on Multimedia, 4, 2 (June 2002), 189--200.

Cited By

View all

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
IWRIDL '06: Proceedings of the 2006 international workshop on Research issues in digital libraries
December 2006
121 pages
ISBN:1595936084
DOI:10.1145/1364742
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 12 December 2006

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. diffusion maps
  2. dimensional reduction
  3. image clustering

Qualifiers

  • Research-article

Conference

IWRIDL06
Sponsor:

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)7
  • Downloads (Last 6 weeks)0
Reflects downloads up to 03 Feb 2025

Other Metrics

Citations

Cited By

View all

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media