Abstract
Clustering is a technique of grouping the data objects into clusters. Many metaheuristic algorithms based on swarm intelligence, physic laws, and chemical reactions, among others, have been developed for clustering. In this study, an enhanced whale optimization algorithm (EWOA) is introduced to solve clustering problems. The whale optimization algorithm (WOA) is adapted and enhanced with two additional operational procedures. The position update equations from the water wave optimization algorithm are incorporated into the algorithm to improve the search space and accelerate the convergence rate. The tabu and neighbourhood search mechanisms were added to handle the local optima situation. The efficiency of the proposed EWOA is measured using a simulation-based experiment conducted on eight benchmark datasets, and the results obtained are then compared to seven existing clustering algorithms/techniques. The performance of each algorithm is compared and analyzed using the average intra-cluster distance and f-measure parameters. The experimental results demonstrated the applicability and feasibility of the enhancements that were made and proved the superiority of the proposed EWOA clustering algorithm.
Similar content being viewed by others
References
Ahmadi R, Ekbatanifard G, Bayat P (2021) A modified grey wolf optimizer based data clustering algorithm. Appl Artif Intell 35(1):63–79
Alshamiri AK, Singh A, Surampudi BR (2016) Artificial bee colony algorithm for clustering: an extreme learning approach. Soft Comput 20(8):3163–3176
Chang DX, Zhang XD, Zheng CW (2009) A genetic algorithm with gene rearrangement for K-means clustering. Pattern Recogn 42(7):1210–1222
Cura T (2012) A particle swarm optimization approach to clustering. Expert Syst Appl 39(1):1582–1588
Dorigo M, Birattari M, Stutzle T (2006) Ant colony optimization. IEEE Comput Intell Mag 1(4):28–39
Ganguly D (2018) A fast partitional clustering algorithm based on nearest neighbours heuristics. Pattern Recogn Lett 112:198–204
Ghany KKA, AbdelAziz AM, Soliman THA, Sewisy AAEM (2022) A hybrid modified step whale optimization algorithm with Tabu search for data clustering. Journal of King Saud University-Computer and Information Sciences 34(3):832–839
Goyal S, Bhushan S, Kumar Y, Rana AUHS, Bhutta MR, Ijaz MF, Son Y (2021) An optimized framework for energy-resource allocation in a cloud environment based on the whale optimization algorithm. Sensors 21(5):1583
Han J, Pei J, Kamber M (2011) Data mining: concepts and techniques. Elsevier
Han X, Quan L, Xiong X, Almeter M, Xiang J, Lan Y (2017) A novel data clustering algorithm based on modified gravitational search algorithm. Eng Appl Artif Intell 61:1–7
Hatamlou A (2013) Black hole: A new heuristic optimization approach for data clustering. Inf Sci 222:175–184
Hatamlou A (2017) A hybrid bio-inspired algorithm and its application. Appl Intell 47:1059–1067
Hatamlou A, Abdullah S, Hatamlou M (2011) Data clustering using big bang–big crunch algorithm. In: Pichappan P, Ahmadi H, Ariwa E (eds) Innovative computing technology. INCT 2011. Communications in Computer and Information Science, vol 241. Springer, Berlin, Heidelberg, pp 383–388. https://doi.org/10.1007/978-3-642-27337-7_36
Jiang B, Wang N (2014) Cooperative bare-bone particle swarm optimization for data clustering. Soft Comput 18(6):1079–1091
Kao YT, Zahara E, Kao IW (2008) A hybridized approach to data clustering. Expert Syst Appl 34(3):1754–1762
Karaboga D, Ozturk C (2011) A novel clustering approach: artificial bee colony (ABC) algorithm. Appl Soft Comput 11(1):652–657
Kumar Y, Kaur A (2021) Variants of bat algorithm for solving partitional clustering problems. Eng Comput. https://doi.org/10.1007/s00366-021-01345-3
Kumar Y, Sahoo G (2015) A hybrid data clustering approach based on improved cat swarm optimization and K-harmonic mean algorithm. AI Commun 28(4):751–764
Kumar Y, Sahoo G (2015) Hybridization of magnetic charge system search and particle swarm optimization for efficient data clustering using neighborhood search strategy. Soft Comput 19(12):3621–3645
Kumar Y, Sahoo G (2015) A two-step artificial bee colony algorithm for clustering. Neural Comput & Applic 28(3):537–551
Kushwaha N, Pant M, Kant S, Jain VK (2018) Magnetic optimization algorithm for data clustering. Pattern Recogn Lett 115:59–65
Mat AN, İnan O, Karakoyun M (2021) An application of the whale optimization algorithm with levy flight strategy for clustering of medical datasets. International Journal of Optimization and Control: Theories & Applications 11(2):216–226
Menéndez HD, Otero FE, Camacho D (2016) Medoid-based clustering using ant colony optimization. Swarm Intelligence 10(2):123–145
Mirjalili S, Lewis A (2016) The whale optimization algorithm. Adv Eng Softw 95:51–67
Motwani M, Arora N, Gupta A (2019) A study on initial centroids selection for partitional clustering algorithms. In: Hoda M, Chauhan N, Quadri S, Srivastava P (eds) Software engineering. Advances in intelligent systems and computing, vol 731. Springer, Singapore, pp 211–220. https://doi.org/10.1007/978-981-10-8848-3_21
Nanda SJ, Panda G (2014) A survey on nature inspired metaheuristic algorithms for partitional clustering. Swarm and Evolutionary computation 16:1–18
Premalatha K, Natarajan AM (2008) A new approach for data clustering based on PSO with local search. Computer and Information Science 1(4):139–145
Purushothaman R, Rajagopalan SP, Dhandapani G (2020) Hybridizing gray wolf optimization (GWO) with grasshopper optimization algorithm (GOA) for text feature selection and clustering. Appl Soft Comput 96:106651
Rahnema N, Gharehchopogh FS (2020) An improved artificial bee colony algorithm based on whale optimization algorithm for data clustering. Multimed Tools Appl 79(43):32169–32194
Santana-Velásquez, A., John Freddy Duitama, M., & Arias-Londoño, J.D. (2020). Classification of diagnosis-related groups using computational intelligence techniques. Proceedings of the 2020 IEEE Colombian Conference on Applications of Computational Intelligence (IEEE ColCACI 2020), 2020, pp. 1–6, https://doi.org/10.1109/ColCACI50549.2020.9247889.
Senthilnath J, Omkar SN, Mani V (2011) Clustering using firefly algorithm: performance study. Swarm and Evolutionary Computation 1(3):164–171
Siddiqi UF, Sait SM (2017) A new heuristic for the data clustering problem. IEEE Access 5:6801–6812
Singh H, Kumar Y (2020) Hybrid artificial chemical reaction optimization algorithm for cluster analysis. Procedia Computer Science 167:531–540
Singh H, Kumar Y (2020) A neighborhood search based cat swarm optimization algorithm for clustering problems. Evol Intel 13(4):593–609
Singh H, Kumar Y, Kumar S (2019) A new meta-heuristic algorithm based on chemical reactions for partitional clustering problems. Evol Intel 12(2):241–252
Stephan P, Stephan T, Kannan R, Abraham A (2021) A hybrid artificial bee colony with whale optimization algorithm for improved breast cancer diagnosis. Neural Comput & Applic 33:13667–13691
Stephan P, Stephan T, Gandomi AH (2022) A novel breast cancer diagnosis scheme with intelligent feature and parameter selections. Comput Methods Prog Biomed 214:106432
Wang GG, Guo L, Gandomi AH, Hao GS, Wang H (2014) Chaotic krill herd algorithm. Inf Sci 274:17–34
Wang R, Zhou Y, Qiao S, Huang K (2016) Flower pollination algorithm with bee pollinator for cluster analysis. Inf Process Lett 116(1):1–14
Yan X, Zhu Y, Zou W, Wang L (2012) A new approach for data clustering using hybrid artificial bee colony algorithm. Neurocomputing 97:241–250
Zhang C, Ouyang D, Ning J (2010) An artificial bee colony approach for clustering. Expert Syst Appl 37(7):4761–4767
Zheng YJ (2015) Water wave optimization: A new nature-inspired metaheuristic. Comput Oper Res 55:1–11
Zhou Y, Zhou Y, Luo Q, Abdel-Basset M (2017) A simplex method-based social spider optimization algorithm for clustering analysis. Eng Appl Artif Intell 64:67–82
Acknowledgments
The authors would like to thank the Editors and the anonymous reviewers for their valuable comments and suggestions which has helped to improve the quality and clarity of the paper. The authors would also like to acknowledge the assistance rendered by Dr. Cherry Bhargava for the general supervision of the research group and general administrative support.
Data Availability
The data that support the findings of this study are available upon request from the corresponding authors.
Funding
This work was supported by the Ministry of Education, Malaysia under grant no. FRGS/1/2020/STG06/UCSI/02/1.
Author information
Authors and Affiliations
Contributions
All authors contributed to the conception and design of the study. Material preparation, data collection, data visualization and data analysis were performed by Hakam Singh, Vipin Rai, Neeraj Kumar, and Pankaj Dadheech. Advanced data analysis and validation were done by Ketan Kotecha, Ganeshsree Selvachandran and Ajith Abraham. The first draft of the manuscript was written by Hakam Singh, Vipin Rai, Neeraj Kumar, and Pankaj Dadheech. The second draft was prepared and edited by Ganeshsree Selvachandran and Ajith Abraham. All authors commented on previous versions of the manuscript. All authors have read and approved the final manuscript.
Corresponding author
Ethics declarations
Competing interests
The authors have no relevant financial or non-financial interests to disclose.
Ethical compliance
Authors’ declaration: This manuscript is the authors’ original work and has not been published elsewhere. All authors have checked the manuscript and have agreed to this submission.
Ethical approval
This article does not contain any studies with human participants or animals performed by any of the authors.
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Singh, H., Rai, V., Kumar, N. et al. An enhanced whale optimization algorithm for clustering. Multimed Tools Appl 82, 4599–4618 (2023). https://doi.org/10.1007/s11042-022-13453-3
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-022-13453-3