Skip to main content

A Potential-Based Density Estimation Method for Clustering Using Decision Graph

  • Conference paper
  • First Online:
Book cover Intelligent Data Engineering and Automated Learning – IDEAL 2017 (IDEAL 2017)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 10585))

Abstract

Clustering is an important unsupervised machine learning method which has played an important role in various fields. As suggested by Alex Rodriguez et al. in a paper published in Science in 2014, the 2D decision graph of the estimated density value versus the minimum distance from the points with higher density values for all the data points can be used to identify the cluster centroids. However, the traditional kernel density estimation methods may be affected by the setting of the parameters and cannot work well for some complex datasets. In this work, a novel potential-based method is designed to estimate density values, which is not sensitive to the parameters and is more effective than the traditional kernel density estimation methods. Experiments on several synthetic and real-world datasets show the superiority of the proposed method in clustering the datasets with various distributions and dimensionalities.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Satyanarayana, A., Acquaviva, V.: Enhanced cobweb clustering for identifying analog galaxies in astrophysics. In: IEEE Electrical and Computer Engineering, pp. 1–4 (2014)

    Google Scholar 

  2. Menéndez, H.D., Plaza, L., Camacho, D.: Combining graph connectivity and genetic clustering to improve biomedical summarization. In: IEEE Congress on Evolutionary Computation, pp. 2740–2747 (2014)

    Google Scholar 

  3. Han, E., et al.: Clustering of 770,000 genomes reveals post-colonial population structure of North America. Nat. Commun. 8, 14238 (2017)

    Article  Google Scholar 

  4. Lu, Y., Wan, Y.: Clustering by sorting potential values (CSPV): a novel potential-based clustering method. Pattern Recogn. 45(9), 3512–3522 (2012)

    Article  Google Scholar 

  5. Kleinberg, J.: An impossibility theorem for clustering. In: NIPS, pp. 463–470 (2002)

    Google Scholar 

  6. Gan, G., Ng, M.K.-P.: k-means clustering with outlier removal. Pattern Recogn. Lett. 8–14 (2017)

    Google Scholar 

  7. Song, H., Yan, H.: Novel K-medoids clustering algorithm based on dynamic search of microparticles under optimized granular computing. Intell. Comput. Appl. (2016)

    Google Scholar 

  8. Frey, B.J., Dueck, D.: Clustering by passing messages between data points. Science 315(5814), 972–976 (2007)

    Article  MathSciNet  MATH  Google Scholar 

  9. Serdah, A.M., Ashour, W.M.: Clustering large-scale data based on modified affinity propagation algorithm. J. Artif. Intell. Soft Comput. Res. 6(1), 23–33 (2016)

    Article  Google Scholar 

  10. Ward Jr., J.H.: Hierarchical grouping to optimize an objective function. J. Am. Stat. Assoc. 58(301), 236–244 (1963)

    Article  MathSciNet  Google Scholar 

  11. Höppner, F.: Fuzzy Cluster Analysis: Methods for Classification, Data Analysis and Image Recognition. Wiley, New York (1999)

    MATH  Google Scholar 

  12. Jain, A.K.: Data clustering: 50 years beyond K-means. Pattern Recogn. Lett. 31(8), 651–666 (2010)

    Article  Google Scholar 

  13. Ester, M., et al.: A density-based algorithm for discovering clusters in large spatial databases with noise. In: KDD, vol. 96(34), pp. 226–231 (1996)

    Google Scholar 

  14. Ghassabeh, Y.A., Rudzicz, F.: The mean shift algorithm and its relation to Kernel regression. Inf. Sci. 348, 198–208 (2016)

    Article  MathSciNet  Google Scholar 

  15. Campello, R.J.G.B., et al.: Hierarchical density estimates for data clustering, visualization, and outlier detection. ACM Trans. Knowl. Discov. Data (TKDD) 10(1), 5 (2015)

    Google Scholar 

  16. Rodriguez, A., Laio, A.: Clustering by fast search and find of density peaks. Science 344(6191), 1492–1496 (2014)

    Article  Google Scholar 

  17. Lu, Y., Hou, X., Chen, X.: A novel travel-time based similarity measure for hierarchical clustering. Neurocomputing 173, 3–8 (2016)

    Article  Google Scholar 

  18. Parzen, E.: On estimation of a probability density function and mode. Ann. Math. Stat. 33(3), 1065–1076 (1962)

    Article  MathSciNet  MATH  Google Scholar 

  19. Fowlkes, E.B., Mallows, C.L.: A method for comparing two hierarchical clusterings. J. Am. Stat. Assoc. 78(383), 553–569 (1983)

    Article  MATH  Google Scholar 

Download references

Acknowledgments

This work is supported by the National Natural Science Foundation of China (Grants No. 61272213) and the Fundamental Research Funds for the Central Universities (Grants No. lzujbky-2016-k07).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Yonggang Lu .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2017 Springer International Publishing AG

About this paper

Cite this paper

Yan, H., Lu, Y., Li, L. (2017). A Potential-Based Density Estimation Method for Clustering Using Decision Graph. In: Yin, H., et al. Intelligent Data Engineering and Automated Learning – IDEAL 2017. IDEAL 2017. Lecture Notes in Computer Science(), vol 10585. Springer, Cham. https://doi.org/10.1007/978-3-319-68935-7_9

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-68935-7_9

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-68934-0

  • Online ISBN: 978-3-319-68935-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics