Abstract
K-means is a popular and simple clustering method by grouping data into predefined K clusters efficiently. However, K-means performs poorly in the presence of poor centers and tends to converge prematurely. Hydrologic Cycle Optimization, as a novel algorithm inspired by the natural phenomena, has a good ability to search for the global optimal solutions. To overcome drawbacks associated with the K-means and find better initial centroids, in this study, a hybrid clustering algorithm based on Hydrologic Cycle Optimization and K-means (abbreviated as HCO+K-means) is proposed. The proposed algorithm includes two modules: HCO module and K-means module. It executes HCO module firstly to find the best individual with optimal fitness value. While the position of the best individual is then considered as initial set of centers for K-means module to search for a higher quality clustering solution. For comparison purpose, the K-means, PSO+K-means, WCA+K-means and HCO+K-means algorithms are chosen to evaluate on six different datasets. The experimental results indicate that the proposed HCO+K-means algorithm has a strong global search ability and obtains better clustering results in comparison to the other clustering methods.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Abraham, A., Das, S., Roy, S.: Swarm intelligence algorithms for data clustering. In: Maimon, O., Rokach, L. (eds.) Soft Computing for Knowledge Discovery and Data Mining, pp. 279–313. Springer, Boston (2008). https://doi.org/10.1007/978-0-387-69935-6_12
Ci, S., Guizani, M., Sharif, H.: Adaptive clustering in wireless sensor networks by mining sensor energy data. Comput. Commun. 30(14), 2968–2975 (2007)
Portela, N.M., Cavalcanti, G.D.C., Ren, T.I.: Semi-supervised clustering for MR brain image segmentation. Expert Syst. Appl. 41(4), 1492–1497 (2014)
Kuo, R.J., Wang, M.J., Huang, T.W.: An application of particle swarm optimization algorithm to clustering analysis. Soft. Comput. 15(3), 533–542 (2011)
Pollard, D.: A central limit theorem for k-means clustering. Ann. Probab. 10(4), 919–926 (1982)
Cao, D.N., Cios, K.J.: GAKREM: a novel hybrid clustering algorithm. Inf. Sci. 178(22), 4205–4227 (2008)
Laszlo, M., Mukherjee, S.: A genetic algorithm that exchanges neighboring centers for k-means clustering. Pattern Recogn. Lett. 28(16), 2359–2366 (2007)
Li, H., He, H., Wen, Y.: Dynamic particle swarm optimization and k-means clustering algorithm for image segmentation. Opt.-Int. J. Light. Electron Opt. 126(24), 4817–4822 (2015)
Niknam, T., Amiri, B.: An efficient hybrid approach based on PSO, ACO and k-means for cluster analysis. Appl. Soft Comput. 10(1), 183–197 (2010)
Kwedlo, W.: A clustering method combining differential evolution with the k-means algorithm. Pattern Recogn. Lett. 32(12), 1613–1621 (2011)
Yan, X., Niu, B.: Hydrologic cycle optimization part I: background and theory. In: Tan, Y., Shi, Y., Tang, Q. (eds.) ICSI 2018. LNCS, vol. 10941, pp. 341–349. Springer, Cham (2018). https://doi.org/10.1007/978-3-319-93815-8_33
Niu, B., Liu, H., Yan, X.: Hydrologic cycle optimization part II: experiments and real-world application. In: Tan, Y., Shi, Y., Tang, Q. (eds.) ICSI 2018. LNCS, vol. 10941, pp. 350–358. Springer, Cham (2018). https://doi.org/10.1007/978-3-319-93815-8_34
Jain, Anil K.: Data clustering: 50 years beyond K-means. In: Daelemans, W., Goethals, B., Morik, K. (eds.) ECML PKDD 2008. LNCS (LNAI), vol. 5211, pp. 3–4. Springer, Heidelberg (2008). https://doi.org/10.1007/978-3-540-87479-9_3
Kennedy, J.: Particle swarm optimization. In: Sammut, C., Webb, G.I. (eds.) Encyclopedia of Machine Learning, pp. 760–766. Springer, Boston (2010)
Eskandar, H., Sadollah, A., Bahreininejad, A., Hamdi, M.: Water cycle algorithm–a novel metaheuristic optimization method for solving constrained engineering optimization problems. Comput. Struct. 110–111(10), 151–166 (2012)
Acknowledgment
This work is partially supported by The National Natural Science Foundation of China (Grants Nos. 71571120, 71271140, 71771154, 61603310, 71471158, 71001072, 61472257), and Research Foundation of Shenzhen University (85303/00000155).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Niu, B., Liu, H., Liu, L., Wang, H. (2018). A Hybrid Data Clustering Approach Based on Hydrologic Cycle Optimization and K-means. In: Qiao, J., et al. Bio-inspired Computing: Theories and Applications. BIC-TA 2018. Communications in Computer and Information Science, vol 952. Springer, Singapore. https://doi.org/10.1007/978-981-13-2829-9_30
Download citation
DOI: https://doi.org/10.1007/978-981-13-2829-9_30
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-13-2828-2
Online ISBN: 978-981-13-2829-9
eBook Packages: Computer ScienceComputer Science (R0)