Abstract
Density peaks clustering (DPC) algorithm provides an efficient method to quickly find cluster centers with decision graphs. In recent years, due to its unique parameters, no iteration, and good robustness, it has been widely studied and applied. However, it also has some shortcomings, such as no adaptability, inadaptability to high-dimensional data and accuracy is easily affected. For reducing the higher time complexity of DPC, we introduce the polar coordinates to DPC (PC-DPC). Firstly, obtain the distance from every point to third-party point and the cosine value of the angle formed with the third-party vector, and reorder the points by distances and cosine values. Then, select other points in the adjacent sequence number of each point to calculate distances, and build a sparse distance matrix. Finally, the sparse distance matrix is used as the input of DPC to obtain clustering results. Theoretical analysis and experiments show that, compared with DPC and other algorithms, PC-DPC greatly reduces running time of DPC while maintaining clustering precision.








Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.References
Hilbert M, Lopez P (2011) The World’s technological capacity to store, communicate, and compute information. Since 332(6025):60–65
Chen XQ, Zhou LJ, Liu YZ (2017) A survey of clustering algorithms. Integ Technol 6(003):41–49
MacQueen JB (1967) Some methods for classification and analysis of multivariate observations. Proceed Fifth Berkeley Symposium Math Stat Probability:281–297
Hae-Sang, Park, Chi-Hyuck, et al. A simple and fast algorithm for k-medoids clustering - sciencedirect. Exp Syst Appl, 2009, 36(Part 2): 3336–3341
Karypis G, Han EH, Kumar V (1999) Chameleon: hierarchical clustering using dynamic modeling. Computer 32(8):68–75
Zhang T, Ramakrishnan R, Livny M (1996) BIRCH: an efficient data clustering method for very large databases. ACM SIGMOD Rec 25(2):103–114
Ester M, Kriegel HP, Sander J et al (1996) A density-based algorithm for discovering clusters in large spatial databases with noise. Proceed 2nd Int Conf Knowledge Discovery Data Mining:226–231
Rodriguez A, Laio A (2014) Clustering by fast search and find of density peaks. Science 344(6191):1492–1496
Wang W, Yang J, Muntz R (1997) STING: A statistical information grid approach to spatial data mining. Proceed 23rd Int Conf Very Large Data Bases:186–195
Agrawal R, Gehrke J, Gunopulos D et al (1998) Automatic subspace clustering of high dimensional data for data mining applications. Proceed 1998 ACM SIGMOD Int Conf Manag Data:94–105
Xing CZ, Zhao QY, Wang X et al (2017) Research on accelerated EM algorithm based on robust Gaussian mixture model. Comput Appl Res 04:1042–1046
Sun YF, Lu YS (2007) A grid based subspace clustering algorithm for high dimensional data streams. Comput Sci (04):199–203+221
et al (2018) A survey of clustering with deep learning: from the perspective of network architecture. IEEE Access 6:39501–39514Min E, Guo X, Liu Qet al A survey of clustering with deep learning: from the perspective of network architecture. IEEE Access, 2018, 6:39501–39514
Luxburg UV (2007) A tutorial on spectral clustering. Stat Comput 17(4):395–416
Zhao J, Tang J, Fan T, Li C, Xu L (2020) Density peaks clustering based on circular partition and grid similarity. Concurr Comput Pract Exp 32(7):e5567
Yu D, Liu G, Guo M, Liu X, Yao S (2019) Density peaks clustering based on weighted local density sequence and nearest neighbor assignment. IEEE Access 7:34301–34317
Du M, Ding S, Jia H (2016) Study on density peaks clustering based on k-nearest neighbors and principal component analysis. Knowl-Based Syst 99:135–145
Gong S F, Zhang Y F. EDDPC: an efficient distributed density center clustering algorithm. Comput Res Dev 2016, (06): 1400–1409
Dean J, Ghemawat S (2008) MapReduce: simplified data processing on large clusters. Commun ACM 51(1):107–113
Ji X, Yao S, Zhao P (2020) Density peak clustering algorithm optimized by relative neighborhood and pruning strategy. J Auto 46(3):562–575
Xu X, Ding S, Du M et al (2018) DPCG: an efficient density peaks clustering algorithm based on grid. Int J Mach Learn Cybern 9(5):743–754
Xu X, Ding S, Sun T (2018) A fast density peaks clustering algorithm based on pre-screening. 2018 IEEE Int Conf Big Data and Smart Comput (BigComp):513–516
Chen Y, Hu X, Fan W, Shen L, Zhang Z, Liu X, du J, Li H, Chen Y, Li H (2020) Fast density peak clustering for large scale data based on kNN. Knowl-Based Syst 187:104824
Xu X, Ding S, Wang Y, Wang L, Jia W (2021) A fast density peaks clustering algorithm with sparse search. Inf Sci 554:61–83
Xu X, Ding S, Shi Z (2018) An improved density peaks clustering algorithm with fast finding cluster centers. Knowl-Based Syst 158:65–74
Vinh NX, Epps J, Bailey J (2010) Information theoretic measures for clusterings comparison: variants, properties, normalization and correction for chance. J Mach Learn Res 11:2837–2854
Yang Y, Jin F, Kamel M (2008) A survey of clustering validity evaluation. Comput Appl Res (06, 1630):–1632+1638
Xie J, Gao H, Xie W, Liu X, Grant PW (2016) Robust clustering by detecting density peaks and assigning points based on fuzzy weighted K-nearest neighbors. Inform Sci An Int J 354:19–40
Gionis A, Mannila H, Tsaparas P (2007) Clustering aggregation. ACM Trans Knowl Discov Data 1(1):4
Tu B, Zhang X, Kang X, Zhang G, Li S (2019) Density peak-based Noisy label detection for hyperspectral image classification. IEEE Trans Geosci Remote Sens 57(3):1573–1584
Zeng X, Chen A, Zhou M (2019) Color perception algorithm of medical images using density peak based hierarchical clustering. Biomed Signal Process Control 48:69–79
Yan H, Wang L, Lu Y (2019) Identifying cluster centroids from decision graph automatically using a statistical outlier detection method. Neurocomputing 329:348–358
Xu X, Ding S, Wang Y (2020) Optimized density peaks clustering algorithm based on dissimilarity measure. J Software 31(12):1–13
Xu X, Ding S, Sun T et al (2018) Large-scale density peaks clustering algorithm based on grid screening. J Comput Res Dev 55(11):2419–2429
Chen JY, He HH (2016) A fast density-based data stream clustering algorithm with cluster centers self-determined for mixed data. Inf Sci 345:271–293
Wang M, Min F, Zhang ZH, Wu YX (2017) Active learning through density clustering. Expert Syst Appl 85:305–317
Abdul MM, Huang JZ, Wei C et al (2018) I-nice: a new approach for identifying the number of clusters and initial cluster Centres. Inform Ences 466:129–151
Du J, Ma Y, Huang H (2021) Clustering algorithm based on local gravity and distance. Comput Appl:1–9
Acknowledgements
This work is supported by the National Natural Science Foundation of China under Grant no.61976216 and no.61672522.
Author information
Authors and Affiliations
Corresponding authors
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Li, C., Ding, S., Xu, X. et al. Fast density peaks clustering algorithm in polar coordinate system. Appl Intell 52, 14478–14490 (2022). https://doi.org/10.1007/s10489-022-03360-3
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10489-022-03360-3