Abstract
Clustering is a standard paradigm for associating some similar objects to a cluster while associating others to their respective clusters in an unsupervised manner. Different techniques are proposed in the domain of clustering depending upon the underlying methodology slightly vary from one another. Some of these are regarded as Hierarchical Clustering, K-means Clustering, Spectral Clustering, Affinity Propagation and Density based spatial Clustering. Their strengths and weaknesses are different for different datasets. In this research, we have used three benchmark datasets to analyze the merits and demerits for each of the clustering techniques. After applying these algorithms to the datasets named letter, glass and wine, we presented the critical review on why a specific clustering algorithm lacks with respect to execution time and clustering quality measured by Silhouette’s coefficient and why an algorithm performs better than others on the same dataset.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Verma, M.: Artificial intelligence and its scope in different areas with special reference to the field of education. Int. J. Adv. Educ. Res. 3, 2455–6157 (2018). www.educationjournal.org
Schuh, G., et al.: Data mining definitions and applications for the management of production complexity. Procedia CIRP 81, 874–879 (2019). https://doi.org/10.1016/j.procir.2019.03.217
Soni, N., Ganatra, A.: Comparative study of several clustering algorithms. Int. J. Adv. Comput. Res. 2(6), 37–42 (2012)
Guyon, I., Von Luxburg, U., Williamson, R.C.: Clustering: science or art. In: JMLR Workshop and Conference Proceedings, vol. 27, pp. 65–79 (2012). https://www.informatik.uni-hamburg.de/ML/contents/people/luxburg/workshops/ClusteringScienceOrArt09/opinions/opinion-artorscience.pdf
Lloyd, S.P.: Least squares quantization in PCM. IEEE Trans. Inf. Theor. 28(2), 129–137 (1982). https://doi.org/10.1109/TIT.1982.1056489
Kanungo, T., et al.: Efficient k-means clustering algorithm: analysis and implementation. IEEE Trans. Patt. Anal. Mach. Intell. 24(7), 881–892 (2002)
Jahangoshai Rezaee, M., Eshkevari, M., Saberi, M., Hussain, O.: GBK-means clustering algorithm: an improvement to the K-means algorithm based on the bargaining game. Knowl.-Based. Syst. 213, 106672 (2021). https://doi.org/10.1016/j.knosys.2020.106672
Xu, X., Ding, S., Wang, Y., Wang, L., Jia, W.: A fast density peaks clustering algorithm with sparse search. Inf. Sci. (Ny) 554, 61–83 (2021). https://doi.org/10.1016/j.ins.2020.11.050
Ishizaka, A., Lokman, B., Tasiou, M.: A stochastic multi-criteria divisive hierarchical clustering algorithm. Omega (United Kingdom) 103, 102370 (2021). https://doi.org/10.1016/j.omega.2020.102370
Shi, N., Liu, X., Guan, Y.: Research on k-means clustering algorithm: an improved k-means clustering algorithm. In: 3rd International Symposium on Intelligent Information Technology and Security Informatics, IITSI 2010, pp. 63–67 (2010).https://doi.org/10.1109/IITSI.2010.74
Sharma, K.K., Seal, A., Herrera-Viedma, E., Krejcar, O.: An enhanced spectral clustering algorithm with s-distance. Symmetry (Basel) 13(4), 1–17 (2021). https://doi.org/10.3390/sym13040596
Sinaga, K.P., Yang, M.S.: Unsupervised K-means clustering algorithm. IEEE Access 8, 80716–80727 (2020). https://doi.org/10.1109/ACCESS.2020.2988796
Dunn, J.C.: A fuzzy relative of the ISODATA process and its use in detecting compact well-separated clusters. J. Cybern. 3(3), 32–57 (2008)
Pal, N.R., Biswas, J.: Cluster validation using graph theoretic concepts. Pattern Recognit. 30(6), 847–857 (1997). https://doi.org/10.1016/S0031-3203(96)00127-6
Ilc, N.: Modified Dunn’s cluster validity index based on graph theory. Prz. Elektrotechniczny (Elect. Rev.) 88(2), 126–131 (2012)
Tibshirani, R., Walther, G., Hastie, T.: Estimating the number of clusters in a data set via the gap statistic. J. R. Stat. Soc. Ser. B Stat. Methodol. 63(2), 411–423 (2001). https://doi.org/10.1111/1467-9868.00293
Davies, D.L., Bouldin, D.W.: A cluster separation measure. IEEE Trans. Pattern Anal. Mach. Intell. (2), 224–227 (1979)
Rousseeuw, P.J.: Silhouettes: a graphical aid to the interpretation and validation of cluster analysis. J. Comput. Appl. Math. 20(C), 53–65 (1987). https://doi.org/10.1016/0377-0427(87)90125-7
Caliński, T., Harabasz, J.: A dendrite method for cluster analysis. Commun. Stat. 3(1), 1–27 (1974)
Müllner, D.: Modern hierarchical, agglomerative clustering algorithms, no. 1973, pp. 1–29 (2011). http://arxiv.org/abs/1109.2378
Wang, K.J., Zhang, J.Y., Li, D., Zhang, X.N., Guo, T.: Adaptive affinity propagation clustering. Zidonghua Xuebao/Acta Autom. Sin. 33(12), 1242–1246 (2007). https://doi.org/10.1360/aas-007-1242
Baby, P., Sasirekha, K.: Agglomerative hierarchical clustering algorithm- a review. Int. J. Sci. Res. Publ. 3(3), 2–4 (2013)
Dueck, D.: Affinity propagation: clustering data by passing messages (2009)
Von Luxburg, U.: A tutorial on spectral clustering. Stat. Comput. 17(4), 395–416 (2007). https://doi.org/10.1007/s11222-007-9033-z
Fong, S., Rehman, S.U., Aziz, K., Science, I.: DBSCAN : Past, Present and Future, pp. 232–238 (2014)
Pant, M., Radha, T., Singh, V.P.: Particle swarm optimization using Gaussian inertia weight. In: Proceedings of International Conference on Computational Intelligence and Multimedia Applications, ICCIMA 2007, vol. 1, pp. 97–102 (2008).https://doi.org/10.1109/ICCIMA.2007.328
Acknowledgement
This research was supported by Universiti Tun Hussein Onn Malaysia (UTHM) through Tier 1 vot. H938.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Amna, Nawi, N.M., Aamir, M., Mushtaq, M.F. (2022). The Comparative Performance Analysis of Clustering Algorithms. In: Ghazali, R., Mohd Nawi, N., Deris, M.M., Abawajy, J.H., Arbaiy, N. (eds) Recent Advances in Soft Computing and Data Mining. SCDM 2022. Lecture Notes in Networks and Systems, vol 457. Springer, Cham. https://doi.org/10.1007/978-3-031-00828-3_34
Download citation
DOI: https://doi.org/10.1007/978-3-031-00828-3_34
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-00827-6
Online ISBN: 978-3-031-00828-3
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)