Skip to main content
Log in

Multi-level clustering based on cluster order constructed with dynamic local density

  • Published:
Applied Intelligence Aims and scope Submit manuscript

Abstract

Density-based clustering has gained increasing attention during the past decades as it allows the discovery of clusters with arbitrary shapes and is robust to noisy objects. However, existing density-based clustering approaches tend to fail if there exist multiple clusters with different densities in a sea of noise. In this paper, we propose a new multi-level clustering method by exploiting the dynamic local density with wavelet transform. Specifically, a concept of dynamic reverse k-nearest neighbor is first introduced, and its count distribution is modeled as a Poisson distribution. The dynamic local density, which is robust to density varieties, is further defined with the cumulative Poisson distribution function. Afterward, a cluster order is constructed based on the derived dynamic local density and finally used to yield the clusters by employing the wavelet transform. Compared to existing approaches, our proposed method can detect clusters with different densities and allows obtaining more clustering information such as the number of clusters, break points between clusters, the boundary of clusters, etc. Extensive experiments on both synthetic and real-world data sets have demonstrated that our proposed method is effective and produces better clustering results when compared to many state-of-the-art clustering algorithms.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13

Similar content being viewed by others

References

  1. Saxena A, Prasad M, Gupta A, Bharill N, Patel O P, Tiwari A, Er M J, Ding W, Lin C-T (2017) A review of clustering techniques and developments. Neurocomputing 267:664–681

    Article  Google Scholar 

  2. Campello R J G B, Kroeger P, Sander J, Zimek A (2020) Density-based clustering. Wiley Interdisciplinary Reviews-Data Mining and Knowledge Discovery, 10

  3. Tang D, Zhang S, Chen J, Wang X (2021) The detection of low-rate dos attacks using the sadbscan algorithm. Inf Sci 565:229–247

    Article  MathSciNet  Google Scholar 

  4. Xie X, Duan L, Qiu T, Li J (2021) Quantum algorithm for mmng-based dbscan. Sci rep 11:15559

    Article  Google Scholar 

  5. Rad M H, Abdolrazzagh-Nezhad M (2020) A new hybridization of dbscan and fuzzy earthworm optimization algorithm for data cube clustering. Soft Comput 24:15529–15549

    Article  Google Scholar 

  6. Lv Y, Ma T, Tang M, Cao J, Tian Y, AL-Dhelaan A, Al-Rodhaan M (2016) An efficient and scalable density-based clustering algorithm for datasets with complex structures. Neurocomputing 171:9–22

    Article  Google Scholar 

  7. Bryant A, Cios K (2018) Rnn-dbscan: A density-based clustering algorithm using reverse nearest neighbor density estimates. IEEE Trans Knowl Data Eng 30:1109–1121

    Article  Google Scholar 

  8. Lu J, Zhu Q (2017) An effective algorithm based on density clustering framework. IEEE Access 5:4991–5000

    Article  Google Scholar 

  9. Cassisi C, Ferro A, Giugno R, Pigola G, Pulvirenti A (2013) Enhancing density-based clustering: Parameter reduction and outlier detection. Inf Syst 38(3):317–330

    Article  Google Scholar 

  10. Zhu Q, Feng J, Huang J (2016) Natural neighbor: A self-adaptive neighborhood method without parameter k. Pattern Recogn Lett 80:30–36

    Article  Google Scholar 

  11. Xie H, Li P (2021) A density-based evolutionary clustering algorithm for intelligent development. Eng Appl Artif Intell, 104

  12. Rodriguez A, Laio A (2014) Clustering by fast search and find of density peaks. Science 344:1492–1496

    Article  Google Scholar 

  13. Du M, Ding S, Jia H (2016) Study on density peaks clustering based on k-nearest neighbors and principal component analysis. Knowl-Based Syst 99:135–145

    Article  Google Scholar 

  14. Liu R, Wang H, Yu X (2018) Shared-nearest-neighbor-based clustering by fast search and find of density peaks. Inf Sci 450:200–226

    Article  MathSciNet  Google Scholar 

  15. Ni L, Luo W, Zhu W, Liu W (2019) Clustering by finding prominent peaks in density space. Eng Appl Artif Intell 85:727–739

    Article  Google Scholar 

  16. Lu J, Zhu Q, Wu Q (2018) A novel data clustering algorithm using heuristic rules based on k-nearest neighbors chain. Eng Appl Artif Intell 72:213–227

    Article  Google Scholar 

  17. Hou J, Zhang A, Qi N (2020) Density peak clustering based on relative density relationship. Pattern Recogn, 108

  18. Wang S, Li Q, Zhao C, Zhu X, Yuan H, Dai T (2021) Extreme clustering - a clustering method via density extreme points. Inf Sci 542:24–39

    Article  MathSciNet  MATH  Google Scholar 

  19. Vadapalli S, Valluri S R, Karlapalem K (2006) A simple yet effective data clustering algorithm. In: IEEE International conference on data mining, p 1108

  20. Ren Y, Wang N, Li M, Xu Z (2020) Deep density-based image clustering. Knowl-Based Syst 197:1–8

    Article  Google Scholar 

  21. Guo S, Xu J, Chen D, Zhang C, Wang X, Zhao R (2020) Density-aware feature embedding for face clustering. In: IEEE Conference on Computer Vision and Pattern Recognition, pp 6697– 6705

  22. Qi C, Zhang J, Jia H, Mao Q, Wang L, Song H (2021) Deep face clustering using residual graph convolutional network. Knowl-Based Syst 211:2854–2871

    Article  Google Scholar 

  23. Chen R, Tang Y, Tian L, Zhang C, Zhang W (2022) Deep convolutional self-paced clustering. Appl Intell 52(5):4858–4872

    Article  Google Scholar 

  24. Inouye D, Yang E, Allen G, Ravikumar P (2017) A review of multivariate distributions for count data derived from the poisson distribution. Wiley interdisciplinary reviews Computational statistics, 9

  25. Vagner A (2016) The gridoptics clustering algorithm. Intell Data Anal 20:1061–1084

    Article  Google Scholar 

  26. Agrawal K P, Garg S, Sharma S, Patel P (2016) Development and validation of optics based spatio-temporal clustering technique. Inf Sci 369:388–401

    Article  Google Scholar 

  27. Zhu X, Nie S, Wang C, Xi X, Wang J, Li D, Zhou H (2021) A noise removal algorithm based on optics for photon-counting lidar data. IEEE Geosci Remote Sens Lett 8:1471–1475

    Article  Google Scholar 

  28. Franti P, Sieranoja S (2018) K-means properties on six clustering benchmark datasets. Appl Intell 48(12):4743–4759

    Article  MATH  Google Scholar 

  29. Amin H U, Yusoff M Z, Ahmad R F (2020) A novel approach based on wavelet analysis and arithmetic coding for automated detection and diagnosis of epileptic seizure in eeg signals using machine learning techniques. Biomedical Signal Processing and Control, 56

  30. Dogan S, Tuncer T (2021) A novel statistical decimal pattern-based surface electromyogram signal classification method using tunableq-factor wavelet transform. Soft Comput 25:1085– 1098

    Article  Google Scholar 

  31. Suzuki T (2020) Wavelet-based spectral-spatial transforms for cfa-sampled raw camera image compression. IEEE Trans Image Process 29:433–444

    Article  MathSciNet  MATH  Google Scholar 

  32. Ma H, Liu D, Xiong R, Wu F (2020) iwave: Cnn-based wavelet-like transform for image compression. IEEE Trans Multimed 22(7):1667–1679

    Article  Google Scholar 

  33. Daubechies I (1990) The wavelet transform, time-frequency localization and signal analysis. IEEE Trans Inf Theory 36:961–1005

    Article  MathSciNet  MATH  Google Scholar 

  34. Shyh-Jier H, Cheng-Tao H (2002) Coiflet wavelet transform applied to inspect power system disturbance-generated signals. IEEE Trans Aerosp Electron Syst 38:1

    Google Scholar 

  35. Bergil E, Oral C, Ergul E U (2021) Efficient hand movement detection using k-means clustering and k-nearest neighbor algorithms. J Med Biol Eng 41(1):11–24

    Article  Google Scholar 

  36. Jafari M M, Ojaghlou H, Zare M, Schumann G J-P (2021) Application of a novel hybrid wavelet-anfis/fuzzy c-means clustering model to predict groundwater fluctuations. Atmosphere 12:1

    Google Scholar 

  37. Kulagin V P, Akimov D A, Pavelyev S A, Potapov D A (2019) Automated identification of critical malfunctions of aircraft engines based on modified wavelet transform and deep neural network clustering. In: 2019 Workshop on materials and engineering in aeronautics (MEA), vol 714

  38. Sheikholeslami G, Chatterjee S, Zhang AD (2000) Wavecluster: a wavelet-based clustering approach for spatial data in very large databases. VLDB J 8(3-4):289–304

    Article  Google Scholar 

  39. Xu Y, Zhu Q, Fan Z, Qiu M, Chen Y, Liu H (2013) Coarse to fine k nearest neighbor classifier. IEEE Trans Knowl Data Eng 34(9):980–986

    Google Scholar 

  40. Ester M, Kriegel HP, Sander J, Xu X (1996) A density-based algorithm for discovering clusters in large spatial databases with noise. In: Proceedings of the second international conference on knowledge discovery and data mining (KDD-96), pp 226–231

  41. Ankerst M, Breunig M M, Kriegel H P, Plant C, Sander J (1999) Optics: ordering points to identify the clustering structure. In: SIGMOD 1999, Proceedings ACM SIGMOD international conference on management of data

  42. von Luxburg U (2007) A tutorial on spectral clustering. Stat Comput 17(4):395–416

    Article  MathSciNet  Google Scholar 

  43. Liu X, Cheng H-M, Zhang Z-Y (2020) Evaluation of community detection methods. IEEE Trans Knowl Data Eng 32(9):1736–1746

    Google Scholar 

  44. Zhang S, Wong H-S, Shen Y (2012) Generalized adjusted rand indices for cluster ensembles. Pattern Recogn 45(6):2214–2226

    Article  MATH  Google Scholar 

  45. Dua D, Graff C (2019) Uci machine learning repository [http://archive.ics.uci.edu/ml]. Irvine, CA: University of California, School of Information and Computer Science

  46. Sampat M P, Wang Z, Gupta S, Bovik A C, Markey M K (2009) Complex wavelet structural similarity: a new image similarity index. IEEE Trans Image Process 18(11):2385–2401

    Article  MathSciNet  MATH  Google Scholar 

Download references

Acknowledgements

This work is supported by the Fundamental Research Funds for the Central Universities (ZYGX2019Z014), National Natural Science Foundation of China (61976044, 52079026), Fok Ying-Tong Education Foundation (161062), Science and Technology Research Program of Chongqing Municipal Education Commission (KJQN202103109),and Sichuan Science and Technology Program (2020YFH0037).

We wish to confirm that there are no known conflicts of interest associated with this publication and there has been no significant financial support for this work that could have influenced its outcome.

We confirm that the manuscript has been read and approved by all named authors and that there are no other persons who satisfied the criteria for authorship but are not listed. We further confirm that the order of authors listed in the manuscript has been approved by all of us.

We understand that the Corresponding Author is the sole contact for the Editorial process (including Editorial Manager and direct communications with the office). He/she is responsible for communicating with the other authors about progress, submissions of revisions and final approval of proofs. We confirm that we have provided a current, correct email address which is accessible by the Corresponding Author.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Shao Junming.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Jianyun, L., Junming, S. & Chunling, W. Multi-level clustering based on cluster order constructed with dynamic local density. Appl Intell 53, 9744–9761 (2023). https://doi.org/10.1007/s10489-022-03830-8

Download citation

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10489-022-03830-8

Keywords

Navigation