Abstract
Clustering and classification-based pattern recognition techniques are widely used in various domains. While many approaches exist to perform these tasks, it remains a difficult process to perform clustering and classification simultaneously for particular datasets. In this paper, a method based on k-nearest neighbor (KNN) is presented to classify the dataset with PSO optimized k-medoids clustering. Initial clustering with k-medoids algorithm divides the dataset into smaller and disjoint clusters featuring similarity within clusters and dissimilarity with members of other clusters. Particle swarm optimization (PSO) is an evolutionary algorithm mainly used to optimize the issues in several research areas including data analytics. The fitness function of the PSO approach mentioned in this paper is based on inertia weights that identify the particles with the best positions and velocities for optimization. Additionally, PSO uses a novel elitism concept that allows massive searching capability between the swarm of particles to achieve a better convergence rate. Because of this property, it can be applied throughout different machine learning fields. Moreover, the KNN classifier outperforms the classification task in terms of classifying the optimized particles with high accuracy. The performance of the proposed technique is evaluated by experimenting with datasets taken from open sources. The simulation results revealed that the performance of the proposed method is better than the existing methods in terms of effective clustering as well as accurate classification.
Similar content being viewed by others
References
Aburomman AA, Reaz MBI (2016) A novel SVM-kNN-PSO ensemble method for intrusion detection system. Appl Soft Comput 38:360–372
Alswaitti M, Albughdadi M, Isa NAM (2018) Density-based particle swarm optimization algorithm for data clustering. Expert Syst Appl 91:170–186
Amorèse D, Bossu R, Mazet-Roux G (2015) Automatic clustering of macroseismic intensity data points from internet questionnaires: Efficiency of the partitioning around medoids (PAM). Seismol Res Lett 86(4):1171–1177
Arumugam M, Raes J, Pelletier E, Le Paslier D, Yamada T, Mende DR, Fernandes GR, Tap J, Bruls T, Batto JM, Bertalan M (2011) Enterotypes of the human gut microbiome. Nature 473(7346):174–180
Bishop CM (2006) Pattern recognition and machine learning. Springer
Boro D, Bhattacharyya DK (2015) Particle swarm optimisation-based KNN for improving KNN and ensemble classification performance. Int J Innovative Comput Appl 6(3–4):145–162
Cover T, Hart P (1967) Nearest neighbor pattern classification. IEEE Trans Inf Theory 13(1):21–27
De Falco I, Della Cioppa A, Tarantino E (2005) Evaluation of particle swarm optimization effectiveness in classification. In: International Workshop on Fuzzy Logic and Applications, vol 3849. Springer, pp 164–171
Del Ser J, Villar E, Osaba E (2019) Swarm Intelligence-Recent Advances. New Perspectives and Applications
Duda H, Hart P, David G (2001) Stork, pattern classification. Wiley
Eberhat R, Kennedy J (1995) A new optimizer using particle swarm theory. In Sixth international symposium on micro machine and human science
Gallego AJ, Calvo-Zaragoza J, Valero-Mas JJ, Rico-Juan JR (2018) Clustering-based k-nearest neighbor classification for large-scale data with neural codes representation. Pattern Recogn 74:531–543
Guan C, Yuen KKF, Coenen F (2019) Particle swarm optimized density-based clustering and classification: supervised and unsupervised learning approaches. Swarm Evol Comput 44:876–896
Han J, Kamber M, Tung AK (2001) Spatial clustering methods in data mining. Geographic data mining and knowledge discovery, pp 188–217
Huang KY (2011) A hybrid particle swarm optimization approach for clustering and classification of datasets. Knowl-Based Syst 24(3):420–426
Hsu CW, Lin CJ (2002) A comparison of methods for multiclass support vector machines. IEEE Trans Neural Netw 13(2):415–425
Islam MZ, Estivill-Castro V, Rahman MA, Bossomaier T (2018) Combining K-Means and a genetic algorithm through a novel arrangement of genetic operators for high quality clustering. Expert Syst Appl 91:402–417
Kaufmann L, Rousseeuw PJ (1990) Finding groups in data: an introduction to cluster analysis. Wiley
MacQueen J (1967) Some methods for classification and analysis of multivariate observations. In: Proceedings of the fifth Berkeley symposium on mathematical statistics and probability, vol 1, no 14, pp 281–297
Mitchell TM (1997) Machine learning
Montero P, Vilar JA (2014) TSclust: An R package for time series clustering. J Stat Softw 62(1):1–43
Nayak SK, Rout PK, Jagadev AK, Patnaik S (2017) A modified differential evolution-based fuzzy multi-objective approach for clustering. Int J Manag Decis Mak 16(1):24–49
Ohnishi Y, Huber W, Tsumura A, Kang M, Xenopoulos P, Kurimoto K, Oleś AK, Araúzo-Bravo MJ, Saitou M, Hadjantonakis AK, Hiiragi T (2014) Cell-to-cell expression variability followed by signal reinforcement progressively segregates early mouse lineages. Nat Cell Biol 16(1):27–37
Shi Y (2001) Particle swarm optimization: developments, applications and resources. In: Proceedings of the 2001 congress on evolutionary computation, vol 1. IEEE, pp 81–86
Van den Bergh F, Engelbrecht AP (2004) A cooperative approach to particle swarm optimization. IEEE Trans Evol Comput 8(3):225–239
Wu X, Zhu X, Wu GQ, Ding W (2013) Data mining with big data. IEEE Trans Knowl Data Eng 26(1):97–107
Xiong H, Qiu B, Liu J (2020) An improved multi-swarm particle swarm optimizer for optimizing the electric field distribution of multichannel transcranial magnetic stimulation. Artificial Intel Med 104:101790. https://doi.org/10.1016/j.artmed.2020.101790
Yu D, Liu G, Guo M, Liu X (2018) An improved K-medoids algorithm based on step increasing and optimizing medoids. Expert Syst Appl 92:464–473
Zeng N, Zhang H, Liu W, Liang J, Alsaadi FE (2017) A switching delayed PSO optimized extreme learning machine for short-term load forecasting. Neurocomputing 240:175–182
Zhang X, Wang W, Li Y, Jiao LC (2012) PSO-based automatic relevance determination and feature selection system for hyperspectral image classification. Electron Lett 48(20):1263–1265
Zhang Y, Liu X, Bao F, Chi J, Zhang C, Liu P (2020) Particle swarm optimization with adaptive learning strategy. Knowl-Based Syst 196:105789. https://doi.org/10.1016/j.knosys.2020.105789
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Murugan, T.M., Baburaj, E. Clustering and classification with inertia weight and elitism-based particle swarm optimization. Pattern Anal Applic 24, 1605–1621 (2021). https://doi.org/10.1007/s10044-021-01010-x
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10044-021-01010-x