Location via proxy:   [ UP ]  
[Report a bug]   [Manage cookies]                
skip to main content
research-article

Multi-level clustering based on cluster order constructed with dynamic local density

Published: 12 August 2022 Publication History

Abstract

Density-based clustering has gained increasing attention during the past decades as it allows the discovery of clusters with arbitrary shapes and is robust to noisy objects. However, existing density-based clustering approaches tend to fail if there exist multiple clusters with different densities in a sea of noise. In this paper, we propose a new multi-level clustering method by exploiting the dynamic local density with wavelet transform. Specifically, a concept of dynamic reverse k-nearest neighbor is first introduced, and its count distribution is modeled as a Poisson distribution. The dynamic local density, which is robust to density varieties, is further defined with the cumulative Poisson distribution function. Afterward, a cluster order is constructed based on the derived dynamic local density and finally used to yield the clusters by employing the wavelet transform. Compared to existing approaches, our proposed method can detect clusters with different densities and allows obtaining more clustering information such as the number of clusters, break points between clusters, the boundary of clusters, etc. Extensive experiments on both synthetic and real-world data sets have demonstrated that our proposed method is effective and produces better clustering results when compared to many state-of-the-art clustering algorithms.

References

[1]
Saxena A, Prasad M, Gupta A, Bharill N, Patel OP, Tiwari A, Er MJ, Ding W, and Lin C-T A review of clustering techniques and developments Neurocomputing 2017 267 664-681
[2]
Campello R J G B, Kroeger P, Sander J, Zimek A (2020) Density-based clustering. Wiley Interdisciplinary Reviews-Data Mining and Knowledge Discovery, 10
[3]
Tang D, Zhang S, Chen J, and Wang X The detection of low-rate dos attacks using the sadbscan algorithm Inf Sci 2021 565 229-247
[4]
Xie X, Duan L, Qiu T, and Li J Quantum algorithm for mmng-based dbscan Sci rep 2021 11 15559
[5]
Rad MH and Abdolrazzagh-Nezhad M A new hybridization of dbscan and fuzzy earthworm optimization algorithm for data cube clustering Soft Comput 2020 24 15529-15549
[6]
Lv Y, Ma T, Tang M, Cao J, Tian Y, AL-Dhelaan A, and Al-Rodhaan M An efficient and scalable density-based clustering algorithm for datasets with complex structures Neurocomputing 2016 171 9-22
[7]
Bryant A and Cios K Rnn-dbscan: A density-based clustering algorithm using reverse nearest neighbor density estimates IEEE Trans Knowl Data Eng 2018 30 1109-1121
[8]
Lu J and Zhu Q An effective algorithm based on density clustering framework IEEE Access 2017 5 4991-5000
[9]
Cassisi C, Ferro A, Giugno R, Pigola G, and Pulvirenti A Enhancing density-based clustering: Parameter reduction and outlier detection Inf Syst 2013 38 3 317-330
[10]
Zhu Q, Feng J, and Huang J Natural neighbor: A self-adaptive neighborhood method without parameter k Pattern Recogn Lett 2016 80 30-36
[11]
Xie H, Li P (2021) A density-based evolutionary clustering algorithm for intelligent development. Eng Appl Artif Intell, 104
[12]
Rodriguez A and Laio A Clustering by fast search and find of density peaks Science 2014 344 1492-1496
[13]
Du M, Ding S, and Jia H Study on density peaks clustering based on k-nearest neighbors and principal component analysis Knowl-Based Syst 2016 99 135-145
[14]
Liu R, Wang H, and Yu X Shared-nearest-neighbor-based clustering by fast search and find of density peaks Inf Sci 2018 450 200-226
[15]
Ni L, Luo W, Zhu W, and Liu W Clustering by finding prominent peaks in density space Eng Appl Artif Intell 2019 85 727-739
[16]
Lu J, Zhu Q, and Wu Q A novel data clustering algorithm using heuristic rules based on k-nearest neighbors chain Eng Appl Artif Intell 2018 72 213-227
[17]
Hou J, Zhang A, Qi N (2020) Density peak clustering based on relative density relationship. Pattern Recogn, 108
[18]
Wang S, Li Q, Zhao C, Zhu X, Yuan H, and Dai T Extreme clustering - a clustering method via density extreme points Inf Sci 2021 542 24-39
[19]
Vadapalli S, Valluri S R, Karlapalem K (2006) A simple yet effective data clustering algorithm. In: IEEE International conference on data mining, p 1108
[20]
Ren Y, Wang N, Li M, and Xu Z Deep density-based image clustering Knowl-Based Syst 2020 197 1-8
[21]
Guo S, Xu J, Chen D, Zhang C, Wang X, Zhao R (2020) Density-aware feature embedding for face clustering. In: IEEE Conference on Computer Vision and Pattern Recognition, pp 6697– 6705
[22]
Qi C, Zhang J, Jia H, Mao Q, Wang L, and Song H Deep face clustering using residual graph convolutional network Knowl-Based Syst 2021 211 2854-2871
[23]
Chen R, Tang Y, Tian L, Zhang C, and Zhang W Deep convolutional self-paced clustering Appl Intell 2022 52 5 4858-4872
[24]
Inouye D, Yang E, Allen G, Ravikumar P (2017) A review of multivariate distributions for count data derived from the poisson distribution. Wiley interdisciplinary reviews Computational statistics, 9
[25]
Vagner A The gridoptics clustering algorithm Intell Data Anal 2016 20 1061-1084
[26]
Agrawal KP, Garg S, Sharma S, and Patel P Development and validation of optics based spatio-temporal clustering technique Inf Sci 2016 369 388-401
[27]
Zhu X, Nie S, Wang C, Xi X, Wang J, Li D, and Zhou H A noise removal algorithm based on optics for photon-counting lidar data IEEE Geosci Remote Sens Lett 2021 8 1471-1475
[28]
Franti P and Sieranoja S K-means properties on six clustering benchmark datasets Appl Intell 2018 48 12 4743-4759
[29]
Amin H U, Yusoff M Z, Ahmad R F (2020) A novel approach based on wavelet analysis and arithmetic coding for automated detection and diagnosis of epileptic seizure in eeg signals using machine learning techniques. Biomedical Signal Processing and Control, 56
[30]
Dogan S and Tuncer T A novel statistical decimal pattern-based surface electromyogram signal classification method using tunableq-factor wavelet transform Soft Comput 2021 25 1085-1098
[31]
Suzuki T Wavelet-based spectral-spatial transforms for cfa-sampled raw camera image compression IEEE Trans Image Process 2020 29 433-444
[32]
Ma H, Liu D, Xiong R, and Wu F iwave: Cnn-based wavelet-like transform for image compression IEEE Trans Multimed 2020 22 7 1667-1679
[33]
Daubechies I The wavelet transform, time-frequency localization and signal analysis IEEE Trans Inf Theory 1990 36 961-1005
[34]
Shyh-Jier H and Cheng-Tao H Coiflet wavelet transform applied to inspect power system disturbance-generated signals IEEE Trans Aerosp Electron Syst 2002 38 1
[35]
Bergil E, Oral C, and Ergul EU Efficient hand movement detection using k-means clustering and k-nearest neighbor algorithms J Med Biol Eng 2021 41 1 11-24
[36]
Jafari MM, Ojaghlou H, Zare M, and Schumann GJ-P Application of a novel hybrid wavelet-anfis/fuzzy c-means clustering model to predict groundwater fluctuations Atmosphere 2021 12 1
[37]
Kulagin V P, Akimov D A, Pavelyev S A, Potapov D A (2019) Automated identification of critical malfunctions of aircraft engines based on modified wavelet transform and deep neural network clustering. In: 2019 Workshop on materials and engineering in aeronautics (MEA), vol 714
[38]
Sheikholeslami G, Chatterjee S, and Zhang AD Wavecluster: a wavelet-based clustering approach for spatial data in very large databases VLDB J 2000 8 3-4 289-304
[39]
Xu Y, Zhu Q, Fan Z, Qiu M, Chen Y, and Liu H Coarse to fine k nearest neighbor classifier IEEE Trans Knowl Data Eng 2013 34 9 980-986
[40]
Ester M, Kriegel HP, Sander J, Xu X (1996) A density-based algorithm for discovering clusters in large spatial databases with noise. In: Proceedings of the second international conference on knowledge discovery and data mining (KDD-96), pp 226–231
[41]
Ankerst M, Breunig M M, Kriegel H P, Plant C, Sander J (1999) Optics: ordering points to identify the clustering structure. In: SIGMOD 1999, Proceedings ACM SIGMOD international conference on management of data
[42]
von Luxburg U A tutorial on spectral clustering Stat Comput 2007 17 4 395-416
[43]
Liu X, Cheng H-M, and Zhang Z-Y Evaluation of community detection methods IEEE Trans Knowl Data Eng 2020 32 9 1736-1746
[44]
Zhang S, Wong H-S, and Shen Y Generalized adjusted rand indices for cluster ensembles Pattern Recogn 2012 45 6 2214-2226
[45]
Dua D, Graff C (2019) Uci machine learning repository [http://archive.ics.uci.edu/ml]. Irvine, CA: University of California, School of Information and Computer Science
[46]
Sampat MP, Wang Z, Gupta S, Bovik AC, and Markey MK Complex wavelet structural similarity: a new image similarity index IEEE Trans Image Process 2009 18 11 2385-2401

Recommendations

Comments

Information & Contributors

Information

Published In

cover image Applied Intelligence
Applied Intelligence  Volume 53, Issue 8
Apr 2023
1255 pages

Publisher

Kluwer Academic Publishers

United States

Publication History

Published: 12 August 2022
Accepted: 28 May 2022

Author Tags

  1. Reverse k-nearest neighbor
  2. Poisson distribution
  3. Wavelet transform
  4. Dynamic local density
  5. Cluster order

Qualifiers

  • Research-article

Funding Sources

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 0
    Total Downloads
  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 16 Oct 2024

Other Metrics

Citations

View Options

View options

Get Access

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media