Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
10.1145/1081870.1081948acmconferencesArticle/Chapter ViewAbstractPublication PageskddConference Proceedingsconference-collections
Article

A fast kernel-based multilevel algorithm for graph clustering

Published: 21 August 2005 Publication History

Abstract

Graph clustering (also called graph partitioning) --- clustering the nodes of a graph --- is an important problem in diverse data mining applications. Traditional approaches involve optimization of graph clustering objectives such as normalized cut or ratio association; spectral methods are widely used for these objectives, but they require eigenvector computation which can be slow. Recently, graph clustering with a general cut objective has been shown to be mathematically equivalent to an appropriate weighted kernel k-means objective function. In this paper, we exploit this equivalence to develop a very fast multilevel algorithm for graph clustering. Multilevel approaches involve coarsening, initial partitioning and refinement phases, all of which may be specialized to different graph clustering objectives. Unlike existing multilevel clustering approaches, such as METIS, our algorithm does not constrain the cluster sizes to be nearly equal. Our approach gives a theoretical guarantee that the refinement step decreases the graph cut objective under consideration. Experiments show that we achieve better final objective function values as compared to a state-of-the-art spectral clustering algorithm: on a series of benchmark test graphs with up to thirty thousand nodes and one million edges, our algorithm achieves lower normalized cut values in 67% of our experiments and higher ratio association values in 100% of our experiments. Furthermore, on large graphs, our algorithm is significantly faster than spectral methods. Finally, our algorithm requires far less memory than spectral methods; we cluster a 1.2 million node movie network into 5000 clusters, which due to memory requirements cannot be done directly with spectral methods.

References

[1]
T. Bui and C. Jones. A heuristic for reducing fill-in in sparse matrix factorization. In 6th SIAM Conference on Parallel Processing for Scientific Computing, pages 445--452, 1993.
[2]
P. Chan, M. Schlag, and J. Zien. Spectral k-way ratio cut partitioning. IEEE Trans. CAD-Integrated Circuits and Systems, 13:1088--1096, 1994.
[3]
I. Dhillon, Y. Guan, and B. Kulis. Kernel k-means, spectral clustering and normalized cuts. In Proc. 10th ACM KDD Conference, 2004.
[4]
I. Dhillon, Y. Guan, and B. Kulis. A unified view of kernel k-means, spectral clustering and graph cuts. Technical Report TR-04-25, University of Texas at Austin, 2004.
[5]
I. S. Dhillon, Y. Guan, and J. Kogan. Iterative clustering of high dimensional text data augmented by local search. In Proceedings of The 2002 IEEE International Conference on Data Mining, pages 131--138, 2002.
[6]
W. E. Donath and A. J. Hoffman. Lower bounds for the partitioning of graphs. IBM J. Res. Development, 17:422--425, 1973.
[7]
B. Hendrickson and R. Leland. A multilevel algorithm for partitioning graphs. Technical Report SAND93-1301, Sandia National Laboratories, 1993.
[8]
G. Karypis and V. Kumar. A fast and high quality multilevel scheme for partitioning irregular graphs. SIAM J. Sci. Comput., 20(1):359--392, 1999.
[9]
J. Shi and J. Malik. Normalized cuts and image segmentation. IEEE Trans. Pattern Analysis and Machine Intelligence, 22(8):888--905, August 2000.
[10]
S. X. Yu and J. Shi. Multiclass spectral clustering. In International Conference on Computer Vision, 2003.

Cited By

View all
  • (2024)Research on Feature Extraction and Center Localization Recognition of Circular Hole Based on Gaussian Pyramid Hierarchical Clustering and Hough Circle Detection2024 IEEE 2nd International Conference on Image Processing and Computer Applications (ICIPCA)10.1109/ICIPCA61593.2024.10708992(626-630)Online publication date: 28-Jun-2024
  • (2023)Exploring Clustering Techniques for Analyzing User Engagement Patterns in Twitter DataComputers10.3390/computers1206012412:6(124)Online publication date: 19-Jun-2023
  • (2023)GROW: A Row-Stationary Sparse-Dense GEMM Accelerator for Memory-Efficient Graph Convolutional Neural Networks2023 IEEE International Symposium on High-Performance Computer Architecture (HPCA)10.1109/HPCA56546.2023.10070983(42-55)Online publication date: Feb-2023
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
KDD '05: Proceedings of the eleventh ACM SIGKDD international conference on Knowledge discovery in data mining
August 2005
844 pages
ISBN:159593135X
DOI:10.1145/1081870
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 21 August 2005

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. graph clustering
  2. kernel methods
  3. multilevel methods
  4. spectral clustering

Qualifiers

  • Article

Conference

KDD05

Acceptance Rates

Overall Acceptance Rate 1,133 of 8,635 submissions, 13%

Upcoming Conference

KDD '25

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)22
  • Downloads (Last 6 weeks)4
Reflects downloads up to 31 Dec 2024

Other Metrics

Citations

Cited By

View all
  • (2024)Research on Feature Extraction and Center Localization Recognition of Circular Hole Based on Gaussian Pyramid Hierarchical Clustering and Hough Circle Detection2024 IEEE 2nd International Conference on Image Processing and Computer Applications (ICIPCA)10.1109/ICIPCA61593.2024.10708992(626-630)Online publication date: 28-Jun-2024
  • (2023)Exploring Clustering Techniques for Analyzing User Engagement Patterns in Twitter DataComputers10.3390/computers1206012412:6(124)Online publication date: 19-Jun-2023
  • (2023)GROW: A Row-Stationary Sparse-Dense GEMM Accelerator for Memory-Efficient Graph Convolutional Neural Networks2023 IEEE International Symposium on High-Performance Computer Architecture (HPCA)10.1109/HPCA56546.2023.10070983(42-55)Online publication date: Feb-2023
  • (2022)A Graph Mining Method for Characterizing and Measuring User Engagement in Twitter2022 17th International Workshop on Semantic and Social Media Adaptation & Personalization (SMAP)10.1109/SMAP56125.2022.9942038(1-6)Online publication date: 3-Nov-2022
  • (2022)Graph partitioning and visualization in graph mining: a surveyMultimedia Tools and Applications10.1007/s11042-022-13017-581:30(43315-43356)Online publication date: 23-May-2022
  • (2022)Subgraph mining in a large graph: A reviewWIREs Data Mining and Knowledge Discovery10.1002/widm.145412:4Online publication date: 8-Mar-2022
  • (2021)Shard schedulerProceedings of the 3rd ACM Conference on Advances in Financial Technologies10.1145/3479722.3480989(43-56)Online publication date: 26-Sep-2021
  • (2021)A Layout-Based Classification Method for Visualizing Time-Varying GraphsACM Transactions on Knowledge Discovery from Data10.1145/344130115:4(1-24)Online publication date: 26-Mar-2021
  • (2021)Multiway p-spectral graph cuts on Grassmann manifoldsMachine Learning10.1007/s10994-021-06108-1111:2(791-829)Online publication date: 18-Nov-2021
  • (2020)AML-SVM: Adaptive Multilevel Learning with Support Vector Machines2020 IEEE International Conference on Big Data (Big Data)10.1109/BigData50022.2020.9378010(788-797)Online publication date: 10-Dec-2020
  • Show More Cited By

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media