skip to main content
10.1145/3577530.3577557acmotherconferencesArticle/Chapter ViewAbstractPublication PagescsaiConference Proceedingsconference-collections
research-article

A Density Peak Clustering algorithm based on Adaptive K-nearest Neighbors with Evidential Strategy

Published: 30 March 2023 Publication History

Abstract

A novel clustering method based on density peaks(DPC) was published on the journal Science. However, for DPC, it is a challenge to choose an appropriate cutoff distance dc, which can affect local density distribution of the datasets. Additionally, the assigning strategy in DPC may cause a domino effect: Many following points might be wrongly assigned once a point is assigned erroneously. To overcome these drawbacks, we introduce a density peak clustering method based on adaptive K-nearest neighbors with evidential assigning strategy (named as AKE-DPC). Firstly, we introduce an adaptive K-nearest neighbors strategy to find a proper K which can form a reasonable local density distribution and find cluster centers simultaneously; secondly, after the cluster centers are found, we introduce a strategy based on evidential rules to assign the remaining points. The advantages of AKE-DPC are obvious that negate the need for decision graph and input parameter such as p, a percentage which is used to computer the cutoff distance dc in DPC. Experiments on both synthetic and real-world datasets are conducted to compare our algorithm with both DPC and some DPC-KNN algorithms such as ADPC-KNN, FKNN-DPC and DPC-KNN. The clustering results demonstrate that our algorithm is not only effective but also outperforms other algorithms on most cases.

References

[1]
Liu J, Pham T D, Yan H, Fuzzy mixed-prototype clustering algorithm for microarray data analysis[J]. Neurocomputing, 2017, 276: 53-65.
[2]
R. Xu, I. Wunsch D., Survey of clustering algorithms[J]. IEEE Transaction on Neural Network, 2005,16(3) : 645–678 .
[3]
Jajoo G, Kumar Y, Yadav S K, Blind signal modulation recognition through clustering analysis of constellation signature[J]. Expert Systems with Applications, 2017, 90:13-22.
[4]
Pham N V, Pham L T, Nguyen T D, A new cluster tendency assessment method for fuzzy co-clustering in hyperspectral image analysis[J]. Neurocomputing, 2018,307:213-226.
[5]
Pagnuco, Inti A., Analysis of genetic association using hierarchical clustering and cluster validation indices. Genomics 2017,109:438-445.
[6]
Rodriguez A, Laio A. Machine learning. Clustering by fast search and find of density peaks.[J]. Science, 2014, 344(6191):1492–1496.
[7]
Hathaway R J, Hu Y . Density-Weighted Fuzzy c-Means Clustering[J]. Fuzzy Syst. IEEE Trans. 2009,17 (1) :243–252.
[8]
Franti P, Virmajoki O, Hautamaki V . Fast agglomerative clustering using a k-nearest neighbor graph.[J]. IEEE Transactions on Pattern Analysis & Machine Intelligence, 2006, 28(11):1875-81.
[9]
Geng Y A, Li Q, Zheng R, RECOME: a New Density-Based Clustering Algorithm Using Relative KNN Kernel Density[J]. Information Sciences, 2016, s 436–437:13-30.
[10]
Chen J, Schizas I D . Distributed information-based clustering of heterogeneous sensor data[J]. Signal Processing, 2016,126:35-51.
[11]
Xu J, Wang G, Deng W . DenPEHC: Density peak based efficient hierarchical clustering[J]. Information Sciences, 2016, 373(12):200-218.
[12]
Su Z, Denoeux T, BPEC: Belief-Peaks Evidential Clustering, IEEE Transactions on Fuzzy Systems, 2019, 27(1): 111-123.
[13]
Jain A K. Data Clustering: 50 Years Beyond K-means[M]// Machine Learning and Knowledge Discovery in Databases. 2008.
[14]
Likas A, Vlassis N, Verbeek J J . The global k-means clustering algorithm[J]. Pattern Recognition: The Journal of the Pattern Recognition Society, 2003, 36(2):451-461.
[15]
Kanungo T, Mount D M, Netanyahu N S, An efficient k-means clustering algorithm: analysis and implementation[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2002, 24(7):0-892.
[16]
Arthur D . k-means++ : the advantages of careful seeding[C]// Proceedings of the eighteenth annual ACM-SIAM symposium on Discrete algorithms, 2007. Society for Industrial and Applied Mathematics, 2007.
[17]
Ester M, Kriegel H P, Xu X. A density-based algorithm for discovering clusters a density-based algorithm for discovering clusters in large spatial databases with noise[C]// International Conference on Knowledge Discovery & Data Mining. 1996.
[18]
Xie J, Gao H, Xie W, Robust clustering by detecting density peaks and assigning points based on fuzzy weighted K-nearest neighbors[J]. Information Sciences, 2016,354:19-40.
[19]
Du M, Ding S, Jia H . Study on Density Peaks Clustering Based on k-Nearest Neighbors and Principal Component Analysis[J]. Knowledge-Based Systems, 2016,99:135-145.
[20]
Yaohui L, Zhengming M, Fang Y. Adaptive density peak clustering based on K-nearest neighbors with aggregating strategy[J]. Knowledge-Based Systems, 2017,133:208-220.
[21]
Denoeux T, Marie-Hélène Masson. EVCLUS: Evidential clustering of proximity data[J]. IEEE TRANSACTIONS ON CYBERNETICS, 2004, 34(1):95-109.
[22]
Liu Z G, Pan Q, Dezert J . Evidential classifier for imprecise data based on belief functions[J]. Knowledge-Based Systems, 2013, 52:246-257.
[23]
Liu Z G, Dezert J, Grégoire Mercier, Belief C-Means: An extension of Fuzzy C-Means algorithm in belief functions framework[J]. Pattern Recognition Letters, 2012, 33(3):291-300.
[24]
Liu Z G, Pan Q, Dezert J, Credal c-means clustering method based on belief functions[J]. Knowledge-Based Systems, 2015, 74(1):119-132.
[25]
Shafer, G,: A Mathematical Theory of Evidence[M]. Princeton University Press, Princeton. 1976.
[26]
Smets P . Decision making in the TBM: The necessity of the pignistic transformation[M]. Elsevier Science Inc. 2005.
[27]
Marie-Helene Masson, T. Denoeux. ECM: An evidential version of the fuzzy c-means algorithm[J]. Pattern Recognition, 2008, 41(4):1384-1397.
[28]
Karkkainen I, Franti P . Dynamic local search for clustering with unknown number of clusters[C]// Pattern Recognition, 2002. Proceedings. 16th International Conference on. IEEE, 2002.
[29]
Veenman C J, Reinders M J T, Backer E . A maximum variance cluster algorithm[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2002, 24(9):0-1280.
[30]
Franti P, Virmajoki O, Hautamaki V . Fast agglomerative clustering using a k-nearest neighbor graph.[J]. IEEE Transactions on Pattern Analysis & Machine Intelligence, 2006, 28(11):1875-81.
[31]
Franti P, Virmajoki O . Iterative shrinking method for clustering problems[J]. Pattern Recognition, 2006, 39(5):761-775.
[32]
Bache K, Lichman M . “UCI machine learning repository”, 2013,http://archive. ics. uci. edu/ml.
[33]
Gong C, Su Z, Wang P, Cumulative belief peaks evidential K-nearest neighbor clustering[J]. Knowledge-Based Systems, 2020, 200: 105982.
[34]
Gong C, Su Z, Wang P, An evidential clustering algorithm by finding belief-peaks and disjoint neighborhoods[J]. Pattern Recognition, 2021, 113: 107751.
[35]
Gong C, Su Z, Wang P, Distributed evidential clustering toward time series with big data issue[J]. Expert Systems with Applications, 2022, 191: 116279.
[36]
Gong C, Li Y, Fu D, Self-reconstructive evidential clustering for high-dimensional data[C]//2022 IEEE 38th International Conference on Data Engineering (ICDE). IEEE, 2022: 2099-2112.

Index Terms

  1. A Density Peak Clustering algorithm based on Adaptive K-nearest Neighbors with Evidential Strategy

    Recommendations

    Comments

    Information & Contributors

    Information

    Published In

    cover image ACM Other conferences
    CSAI '22: Proceedings of the 2022 6th International Conference on Computer Science and Artificial Intelligence
    December 2022
    341 pages
    ISBN:9781450397773
    DOI:10.1145/3577530
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 30 March 2023

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. Clustering method
    2. adaptive K-nearest neighbors
    3. evidential rules

    Qualifiers

    • Research-article
    • Research
    • Refereed limited

    Conference

    CSAI 2022

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • 0
      Total Citations
    • 40
      Total Downloads
    • Downloads (Last 12 months)16
    • Downloads (Last 6 weeks)1
    Reflects downloads up to 25 Feb 2025

    Other Metrics

    Citations

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    HTML Format

    View this article in HTML Format.

    HTML Format

    Figures

    Tables

    Media

    Share

    Share

    Share this Publication link

    Share on social media