Skip to main content
Log in

Chameleon algorithm based on improved natural neighbor graph generating sub-clusters

  • Published:
Applied Intelligence Aims and scope Submit manuscript

Abstract

Chameleon algorithm is a hierarchical clustering based on dynamic modeling. It can find high-quality clusters with different shapes, sizes and densities. However, Chameleon algorithm requires user-specifiedkwhen constructing sparse graph, which directly influences the clustering performance. In addition, the graph-partitioning technology used in the original algorithm, hMetis algorithm, is hard to build operation environment, and the number of partitions needs to be specified. These problems are arduous to determine without prior knowledge. In order to overcome the first problem, this paper introduces an improved natural neighbor method to construct a sparse graph, which can reflect the initial sparseness of the data. To address the second problems, this paper proposes a new method of generating sub-clusters in sparse graphs, which is simple and objective. In summary, this paper proposes Chameleon Algorithm Based on Improved Natural Neighbor Graph Generating Sub-clusters (INNGS-Chameleon). This algorithm is tested on 8 synthetic data sets and 10 UCI data sets. The results are compared with the Chameleon algorithm, its improved algorithm and several classic algorithms. The experimental results show that the INNGS-Chameleon algorithm is feasible and effective.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12

Similar content being viewed by others

References

  1. Xu X, Ding S, Shi Z (2018) An improved density peaks clustering algorithm with fast finding cluster centers. Knowl-Based Syst 158:65–74

    Article  Google Scholar 

  2. Wang H, Yang Y, Liu B, Fujita H (2019) A study of graph-based system for multi-view clustering. Knowl-Based Syst 163:1009–1019

    Article  Google Scholar 

  3. Zanin M, Papo D, Sousa PA et al (2016) Combining complex networks and data mining: why and how. Phys Rep-Rev Sect Phys Lett 635:1–44

    MathSciNet  Google Scholar 

  4. Fahy C, Yang S, Gongora M (2019) Ant Colony stream clustering: a fast density clustering algorithm for dynamic data streams. IEEE Trans Cybern 49(6):2215–2228

    Article  Google Scholar 

  5. Mojarad M, Nejatian S, Parvin H, Mohammadpoor M (2019) A fuzzy clustering ensemble based on cluster clustering and iterative fusion of base clusters. Appl Intell 49(7):2567–2581

    Article  Google Scholar 

  6. Lai T, Chen R, Yang C, Li Q, Fujita H, Sadri A, Wang H (2020) Efficient robust model fitting for multistructure data using global greedy search. IEEE Trans Cybern 50(7):3294–3306

    Article  Google Scholar 

  7. Weber LM, Robinson MD (2016) Comparison of clustering methods for high-dimensional single-cell flow and mass cytometry data. Cytom Part A 89A(12):1084–1096

    Article  Google Scholar 

  8. Chen SM, Cheng SH, Lan TC (2016) A novel similarity measure between intuitionistic fuzzy sets based on the centroid points of transformed fuzzy numbers with applications to pattern recognition. Inf Sci 343:15–40

    Article  MathSciNet  Google Scholar 

  9. Zhong YF, Ma AL, Ong YS et al (2018) Computational intelligence in optical remote sensing image processing. Appl Soft Comput 64:75–93

    Article  Google Scholar 

  10. Thomas MC, Zhu W, Romagnoli JA (2017) Data mining and clustering in chemical process databases for monitoring and knowledge discovery. J Process Control 67:160–175

    Article  Google Scholar 

  11. Saxena A, Prasad M, Gupta A, Bharill N, Patel OP, Tiwari A, Er MJ, Ding W, Lin CT (2017) A review of clustering techniques and developments. Neurocomputing 267:664–681

    Article  Google Scholar 

  12. Cai Q, Liu J (2019) Hierarchical clustering of bipartite networks based on multiobjective optimization. IEEE Trans Netw Sci Eng 7(1):421–434

    Article  MathSciNet  Google Scholar 

  13. Ros F, Guillaume S (2019) A hierarchical clustering algorithm and an improvement of the single linkage criterion to deal with noise. Expert Syst Appl 128:96–108

    Article  Google Scholar 

  14. Jafarzadegan M, Safi-Esfahani F, Beheshti Z (2019) Combining hierarchical clustering approaches using the PCA method. Expert Syst Appl 137:1–10

    Article  Google Scholar 

  15. Xie WB, Lee YL, Wang C, Chen DB, Zhou T (2020) Hierarchical clustering supported by reciprocal nearest neighbors. Inf Sci 527:279–292

    Article  MathSciNet  Google Scholar 

  16. Zhang T, Ramakrishnan R, Livny M (1996) BIRCH: an efficient data clustering method for very large. ACM SIGMOD Rec 25(2):103–114

    Article  Google Scholar 

  17. Guha S, Rastogi R, Shim K (1998) CURE: an efficient clustering algorithm for large databases. Inf Syst 26(1):35–58

    Article  Google Scholar 

  18. Guha S, Rastogi R, Shim K (2002) ROCK: a robust clustering algorithm for categorical attributes. Inf Syst 25(5):345–366

    Article  Google Scholar 

  19. Karypis G, Han EH, Kumar V (1999) Chameleon: hierarchical clustering using dynamic modeling. Computer 32(8):68–75

    Article  Google Scholar 

  20. Karypis G, Aggarwal R, Kumar V, Shekhar S (1999) Multilevel hypergraph partitioning: applications in VLSI domain. IEEE Trans Very Large Scale Integr Syst 7(1):69–79

    Article  Google Scholar 

  21. Zhang Y, Ding S, Wang L, Wang Y, Ding L (2020) Chameleon algorithm based on mutual k-nearest neighbors. Appl Intell. https://doi.org/10.1007/s10489-020-01926-7

  22. Zhang W, Li J (2015) Extended fast search clustering algorithm: widely density clusters, no density peaks. Comput Sci Inf Technol 5(7):1–17

    Google Scholar 

  23. Barton T, Bruna T, Kordik P (2019) Chameleon 2: an improved graph-based clustering algorithm. ACM Trans Knowl Discov Data 13(1):1–27

    Article  Google Scholar 

  24. Guo D, Zhao J, Liu J (2019) Research and Application of Improved CHAMELEON Algorithm Based on Condensed Hierarchical Clustering Method. In: Proceedings of the 2019 8th international conference on networks. Communication and Computing. Association for Computing Machinery, Luoyang, pp 14–18

    Google Scholar 

  25. Du M, Ding S, Jia H (2016) Study on density peaks clustering based on k-nearest neighbors and principal component analysis. Knowl-Based Syst 99:135–145

    Article  Google Scholar 

  26. Zhu Q, Feng J, Huang J (2016) Natural neighbor: a self-adaptive neighborhood method without parameter K. Pattern Recogn Lett 80:30–36

    Article  Google Scholar 

  27. Agarwal M, Jaiswal R, Pal A (2015) K-means++ under approximation stability. Theor Comput Sci 588:37–51

    Article  MathSciNet  Google Scholar 

  28. Xu X, Ding S, Xu H et al (2018) A feasible density peaks clustering algorithm with a merging strategy. Soft Comput 23(13):5171–5183

    Article  Google Scholar 

  29. Zhang S, Wong HS, Shen Y (2012) Generalized adjusted Rand indices for cluster ensembles. Pattern Recogn 45(6):2214–2226

    Article  Google Scholar 

  30. Vinh NX, Epps J, Bailey J (2010) Information theoretic measures for clusterings comparison: variants, properties, normalization and correction for chance. J Mach Learn Res 11:2837–2854

    MathSciNet  MATH  Google Scholar 

  31. Macqueen J B (1967) Some methods for classification and analysis of multivariate observations. In: Proceedings of the Fifth Berkeley Symposium on Mathematical Statistics and Probability, Volume 1: Statistics, University of California Press, 5.1:281–297

  32. Ester M, Kriegel H-P, Sander J, Xu X (1996) A density-based algorithm for discovering clusters in large spatial databases with noise. In: KDD'96: Proceedings of the Second International Conference on Knowledge Discovery and Data Mining, 226–231

  33. Schubert E, Sander J, Ester M, Kriegel HP, Xu X (2017) DBSCAN revisited, revisited: why and how you should (still) use DBSCAN. ACM Trans Database Syst 42(3):1–21

    Article  MathSciNet  Google Scholar 

Download references

Acknowledgements

This work is supported by the National Natural Science Foundations of China (no.61976216 and no.61672522).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Shifei Ding.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Zhang, Y., Ding, S., Wang, Y. et al. Chameleon algorithm based on improved natural neighbor graph generating sub-clusters. Appl Intell 51, 8399–8415 (2021). https://doi.org/10.1007/s10489-021-02389-0

Download citation

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10489-021-02389-0

Keywords

Navigation