Skip to main content

A Hybrid Data Clustering Approach Based on Hydrologic Cycle Optimization and K-means

  • Conference paper
  • First Online:
Bio-inspired Computing: Theories and Applications (BIC-TA 2018)

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 952))

Abstract

K-means is a popular and simple clustering method by grouping data into predefined K clusters efficiently. However, K-means performs poorly in the presence of poor centers and tends to converge prematurely. Hydrologic Cycle Optimization, as a novel algorithm inspired by the natural phenomena, has a good ability to search for the global optimal solutions. To overcome drawbacks associated with the K-means and find better initial centroids, in this study, a hybrid clustering algorithm based on Hydrologic Cycle Optimization and K-means (abbreviated as HCO+K-means) is proposed. The proposed algorithm includes two modules: HCO module and K-means module. It executes HCO module firstly to find the best individual with optimal fitness value. While the position of the best individual is then considered as initial set of centers for K-means module to search for a higher quality clustering solution. For comparison purpose, the K-means, PSO+K-means, WCA+K-means and HCO+K-means algorithms are chosen to evaluate on six different datasets. The experimental results indicate that the proposed HCO+K-means algorithm has a strong global search ability and obtains better clustering results in comparison to the other clustering methods.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Abraham, A., Das, S., Roy, S.: Swarm intelligence algorithms for data clustering. In: Maimon, O., Rokach, L. (eds.) Soft Computing for Knowledge Discovery and Data Mining, pp. 279–313. Springer, Boston (2008). https://doi.org/10.1007/978-0-387-69935-6_12

    Chapter  Google Scholar 

  2. Ci, S., Guizani, M., Sharif, H.: Adaptive clustering in wireless sensor networks by mining sensor energy data. Comput. Commun. 30(14), 2968–2975 (2007)

    Article  Google Scholar 

  3. Portela, N.M., Cavalcanti, G.D.C., Ren, T.I.: Semi-supervised clustering for MR brain image segmentation. Expert Syst. Appl. 41(4), 1492–1497 (2014)

    Article  Google Scholar 

  4. Kuo, R.J., Wang, M.J., Huang, T.W.: An application of particle swarm optimization algorithm to clustering analysis. Soft. Comput. 15(3), 533–542 (2011)

    Article  Google Scholar 

  5. Pollard, D.: A central limit theorem for k-means clustering. Ann. Probab. 10(4), 919–926 (1982)

    Article  MathSciNet  Google Scholar 

  6. Cao, D.N., Cios, K.J.: GAKREM: a novel hybrid clustering algorithm. Inf. Sci. 178(22), 4205–4227 (2008)

    Article  Google Scholar 

  7. Laszlo, M., Mukherjee, S.: A genetic algorithm that exchanges neighboring centers for k-means clustering. Pattern Recogn. Lett. 28(16), 2359–2366 (2007)

    Article  Google Scholar 

  8. Li, H., He, H., Wen, Y.: Dynamic particle swarm optimization and k-means clustering algorithm for image segmentation. Opt.-Int. J. Light. Electron Opt. 126(24), 4817–4822 (2015)

    Article  Google Scholar 

  9. Niknam, T., Amiri, B.: An efficient hybrid approach based on PSO, ACO and k-means for cluster analysis. Appl. Soft Comput. 10(1), 183–197 (2010)

    Article  Google Scholar 

  10. Kwedlo, W.: A clustering method combining differential evolution with the k-means algorithm. Pattern Recogn. Lett. 32(12), 1613–1621 (2011)

    Article  Google Scholar 

  11. Yan, X., Niu, B.: Hydrologic cycle optimization part I: background and theory. In: Tan, Y., Shi, Y., Tang, Q. (eds.) ICSI 2018. LNCS, vol. 10941, pp. 341–349. Springer, Cham (2018). https://doi.org/10.1007/978-3-319-93815-8_33

    Chapter  Google Scholar 

  12. Niu, B., Liu, H., Yan, X.: Hydrologic cycle optimization part II: experiments and real-world application. In: Tan, Y., Shi, Y., Tang, Q. (eds.) ICSI 2018. LNCS, vol. 10941, pp. 350–358. Springer, Cham (2018). https://doi.org/10.1007/978-3-319-93815-8_34

    Chapter  Google Scholar 

  13. Jain, Anil K.: Data clustering: 50 years beyond K-means. In: Daelemans, W., Goethals, B., Morik, K. (eds.) ECML PKDD 2008. LNCS (LNAI), vol. 5211, pp. 3–4. Springer, Heidelberg (2008). https://doi.org/10.1007/978-3-540-87479-9_3

    Chapter  Google Scholar 

  14. Kennedy, J.: Particle swarm optimization. In: Sammut, C., Webb, G.I. (eds.) Encyclopedia of Machine Learning, pp. 760–766. Springer, Boston (2010)

    Google Scholar 

  15. Eskandar, H., Sadollah, A., Bahreininejad, A., Hamdi, M.: Water cycle algorithm–a novel metaheuristic optimization method for solving constrained engineering optimization problems. Comput. Struct. 110–111(10), 151–166 (2012)

    Article  Google Scholar 

Download references

Acknowledgment

This work is partially supported by The National Natural Science Foundation of China (Grants Nos. 71571120, 71271140, 71771154, 61603310, 71471158, 71001072, 61472257), and Research Foundation of Shenzhen University (85303/00000155).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Hong Wang .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Niu, B., Liu, H., Liu, L., Wang, H. (2018). A Hybrid Data Clustering Approach Based on Hydrologic Cycle Optimization and K-means. In: Qiao, J., et al. Bio-inspired Computing: Theories and Applications. BIC-TA 2018. Communications in Computer and Information Science, vol 952. Springer, Singapore. https://doi.org/10.1007/978-981-13-2829-9_30

Download citation

  • DOI: https://doi.org/10.1007/978-981-13-2829-9_30

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-13-2828-2

  • Online ISBN: 978-981-13-2829-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics