Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article

Chameleon algorithm based on mutual k-nearest neighbors

Published: 01 April 2021 Publication History

Abstract

Clustering is a typical unsupervised data analysis method, which divides a given data set without label information into multiple clusters. The data on each cluster has a great deal of association, which can be used as the preprocessing stage of other algorithms or for further association analysis. Therefore, clustering plays an important role in a wide range of fields. Chameleon is a clustering algorithm that combines the relative interconnectivity and relative closeness to find clusters of arbitrary shape with high quality. However, the graph-partitioning technology hMETIS algorithm used in the algorithm is difficult to operate and easy to cause uncertainty of results. In addition, the final number of clusters need to be specified by user as a parameter to stop merging, which is difficult to determine without prior information. Aiming at these shortcomings, Chameleon algorithm based on mutual k-nearest neighbors (MChameleon) is proposed. Firstly, the idea of mutual k-nearest neighbors is introduced to directly generate sub-clusters, which omits the process of partitioning graph. Then, the concept of MC modularity is introduced, which is used to objectively identify the final clustering results. By experiments on artificial data sets and UCI data sets, we compared MChameleon with the original Chameleon algorithm, the improved AChameleon algorithm and the classic K-Means, DBSCAN, BIRCH algorithm in accuracy. Experimental results on data sets show that Chameleon algorithm based on mutual k-nearest neighbors has great advantages and is feasible.

References

[1]
Zanin M, Papo D, Sousa PA, et al. Combining complex networks and data mining: why and how Phys Rep 2016 635 1-44
[2]
Du M, Ding S, and Jia H Study on density peaks clustering based on k-nearest neighbors and principal component analysis Knowl-Based Syst 2016 99 135-145
[3]
Nguyen HL, Woon YK, and Ng WK A survey on data stream clustering and classification Knowl Inf Syst 2015 45 3 535-569
[4]
Xu X, Ding S, Xu H, et al. A feasible density peaks clustering algorithm with a merging strategy Soft Comput 2018 23 13 5171-5183
[5]
Khanmohammadi S, Adibeig N, and Shanehbandy S An improved overlapping k-means clustering method for medical applications Expert Syst Appl 2017 67 12-18
[6]
Yu Z, Li L, Liu J, Zhang J, and Han G Adaptive noise immune cluster ensemble using affinity propagation IEEE Trans Knowl Data Eng 2015 27 12 3176-3189
[7]
Morris K and McNicholas PD Clustering, classification, discriminant analysis, and dimension reduction via generalized hyperbolic mixtures Comput Stat and Data An 2016 97 133-150
[8]
Huang D, Wang CD, and Lai JH Locally weighted ensemble clustering IEEE T Cybernetics 2018 48 5 1460-1473
[9]
Han J, Micheline K (2006) Data mining: concepts and techniques. data mining concepts models methods & algorithms second edition, 5(4), pp 1–18
[10]
Fan SY, Ding SF, and Xue Y Self-adaptive kernel K-means algorithm based on the shuffled frog leaping algorithm Soft Comput 2018 22 3 861-872
[11]
Galan SF Comparative evaluation of region query strategies for DBSCAN clustering Inf Sci 2019 502 76-90
[12]
Wu B and Wilamowski BM A fast density and grid based clustering method for data with arbitrary shapes and noise IEEE T Ind Inform 2017 13 4 1620-1628
[13]
Gorricha J and Lobo V Improvements on the visualization of clusters in geo-referenced data using self-organizing maps Comput Geosci 2012 43 177-186
[14]
Ros F and Guillaume S A hierarchical clustering algorithm and an improvement of the single linkage criterion to deal with noise Expert Syst Appl 2019 128 96-108
[15]
Bouguettaya A, Yu Q, Liu X, Zhou X, and Song A Efficient agglomerative hierarchical clustering Expert Syst Appl 2015 42 5 2785-2797
[16]
Karypis G, Han EH, and Kumar V Chameleon: hierarchical clustering using dynamic modeling Computer 1999 32 8 68-75
[17]
Xue W, Liu P, and Liu D Improved Chameleon algorithm using weighted nearest neighbors graph Journal of Computer Applications 2012 10 208-211
[18]
Karypis G, Aggarwal R, Kumar V, and Shekhar S Multilevel hypergraph partitioning: applications in VLSI domain IEEE T VLSI Syst 1999 7 1 69-79
[19]
Guo D, Zhao J, Liu J (2019) Research and application of improved CHAMELEON algorithm based on condensed hierarchical clustering method. In: Proceedings of the 2019 8th international conference on networks, communication and computing. Association for Computing Machinery, Luoyang, pp 14–18
[20]
Zhang W and Li J Extended fast search clustering algorithm: widely density clusters, no density peaks Comput SciInf Technol 2015 5 7 1-17
[21]
Barton T, Bruna T, and Kordik P Chameleon 2: an improved graph-based clustering algorithm ACM Trans Knowl Discov Data 2019 13 1 1-27
[22]
Wang L, Dai G, and Zhao H Research on modularity for evaluating community structure Comput Eng 2010 36 14 227-229
[23]
Garruzzo S and Rosaci D Agent clustering based on semantic negotiation ACM T Auton Adap Sys 2008 3 2 1-40
[24]
Fan J, Jia P, and Ge L Mk-NNG-DPC: density peaks clustering based on improved mutual K-nearest-neighbor graph Int J Mach Learn Cybern 2019 11 6 1179-1195
[25]
Liu H and Zhang S Noisy data elimination using mutual k-nearest neighbor for classification mining J Syst Softw 2012 84 5 1067-1074
[26]
Newman M and Girvan M Finding and evaluating community structure in networks Phys Rev E 2004 69 026113
[27]
Blondel VD, Guillaume JL, Lambiotte R et al (2008) Fast unfolding of communities in large networks. J Stat Mech-Theory E:10008
[28]
Kong B, Zhou L, Liu W (2012) Improved modularity based on Girvan-Newman modularity. In: 2012 second international conference on intelligent system design and Engineering application. IEEE, Sanya, pp 293–296
[29]
Xu X, Ding S, and Shi Z An improved density peaks clustering algorithm with fast finding cluster centers Knowl-Based Syst 2018 158 65-74
[30]
Xu TS, Chiang HD, Liu GY, and Tan CW Hierarchical K-means method for clustering large-scale advanced metering infrastructure data IEEE TPower Deliver 2017 32 2 609-616
[31]
Madan S and Dana KJ Modified balanced iterative reducing and clustering using hierarchies (m-BIRCH) for visual clustering Pattern Anal Appl 2016 19 4 1023-1040

Cited By

View all
  • (2024)Learning using granularity statistical invariants for classificationApplied Intelligence10.1007/s10489-024-05506-x54:8(6667-6681)Online publication date: 1-Apr-2024
  • (2024)A dynamic density-based clustering method based on K-nearest neighborKnowledge and Information Systems10.1007/s10115-023-02038-766:5(3005-3031)Online publication date: 1-May-2024
  • (2022)An efficient DBSCAN optimized by arithmetic optimization algorithm with opposition-based learningThe Journal of Supercomputing10.1007/s11227-022-04634-w78:18(19566-19604)Online publication date: 1-Dec-2022

Recommendations

Comments

Information & Contributors

Information

Published In

cover image Applied Intelligence
Applied Intelligence  Volume 51, Issue 4
Apr 2021
874 pages

Publisher

Kluwer Academic Publishers

United States

Publication History

Published: 01 April 2021
Accepted: 02 September 2020

Author Tags

  1. Clustering analysis
  2. Mutual k-nearest neighbors
  3. Chameleon algorithm
  4. Modularity

Qualifiers

  • Research-article

Funding Sources

  • the National Natural Science Foundations of China

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 01 Sep 2024

Other Metrics

Citations

Cited By

View all
  • (2024)Learning using granularity statistical invariants for classificationApplied Intelligence10.1007/s10489-024-05506-x54:8(6667-6681)Online publication date: 1-Apr-2024
  • (2024)A dynamic density-based clustering method based on K-nearest neighborKnowledge and Information Systems10.1007/s10115-023-02038-766:5(3005-3031)Online publication date: 1-May-2024
  • (2022)An efficient DBSCAN optimized by arithmetic optimization algorithm with opposition-based learningThe Journal of Supercomputing10.1007/s11227-022-04634-w78:18(19566-19604)Online publication date: 1-Dec-2022

View Options

View options

Get Access

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media