Abstract
Data clustering is a popular machine learning technique widely applied for data structure analysis. Among the major techniques, fuzzy co-clustering (FCoC) is known for the capability of complex data processing such as many features, large size, and uncertainty. In some cases, FCoC has demonstrated superior performance over some traditional clustering methods. However, FCoC has some limitations such as being sensitive to the initial cluster centers and being stuck at locally optimal values. Particle swarm optimization (PSO) is a multidisciplinary optimization algorithm. PSO was used to find a suitable initial cluster centers solution for the clustering algorithms. However, PSO is limited and unclear, especially for algorithms with complex structures and many parameters. In this paper, we propose a new fuzzy co-clustering algorithm by using the optimal solution of the PSO algorithm to find the initial centroids for the FCoC algorithm. We call the new algorithm FCOCM. In our experience, we use symbols and mathematical language to describe the process of finding initial centroids called the optimal centroids model (OCM). Thus, our work proposes two new contributions including the OCM model and the FCOCM algorithm. Experiments were conducted on benchmark data sets from the UCI Machine Learning Repository and the School of the Computing University of Eastern Finland to demonstrate the superior performance of the proposed method.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.References
Kennedy J, Eberhart R (1995) Particle swarm optimization. IEEE Int Conf Neural Netw 4:1942–1948
Zeng N, Wang Z, Liu W, Zhang H, Hone K, Liu (2020) X A dynamic neighborhood-based switching particle swarm optimization algorithm, IEEE Transactions on Cybernetics, pp. 1-12
Zeng N, Song D, Li H, You Y, Liu Y, Alsaadic FE (2021) A competitive mechanism integrated multi-objective whale optimization algorithm with differential evolution. Neurocomputing 432:170–182
Luo X, Yuan Y, Chen S, Zeng N, Wang Z (2020) Position-transitional particle swarm optimization-incorporated latent factor analysis, IEEE Transactions on Knowledge and Data Engineering, pp. 1-13,
Rayala SS, Kumar NA (2020) Particle swarm optimization for robot target tracking application, Mater Today: Proc,
Song Y, Zhang F, Liu C (2020) The risk of block chain financial market based on particle swarm optimization. J Computat Appl Math 37015:112667
Gao W, Su C (2020) Analysis of earnings forecast of blockchain financial products based on particle swarm optimization. J Comput Appl Math 372:112724
Li X, Wu X, Xu S, Qing S, Chang P (2019) A novel complex network community detection approach using discrete particle swarm optimization with particle diversity and mutation. Appl Soft Comput 81:105476
Xiong H, Qiu B, Liu J (2020) An improved multi-swarm particle swarm optimizer for optimizing the electric field distribution of multichannel transcranial magnetic stimulation. Artif Intell Med 104:101790
Junior FEF, Yen GG (2019) Particle swarm optimization of deep neural networks architectures for image classification. Swarm Evolut Comput 49:62–74
Farshi TR, Drake JH, Ozcan E (2020) A multimodal particle swarm optimization-based approach for image segmentation. Expert Syst Appl 1491:113233
Janani R, Vijayarani S (2019) Text document clustering using Spectral Clustering algorithm with Particle Swarm Optimization. Expert Syst Appl 13415:192–200
Cui Z, Zhang J, Di W, Cai X, Chen J (2020) Hybrid many-objective particle swarm optimization algorithm for green coal production problem. Inform Sci 518:256–271
Hussain MM, Fujimoto N (2020) GPU-based parallel multi-objective particle swarm optimization for large swarms and high dimensional problems. Parallel Comput 92:102589
Zhou Y, Zhao Z, Cheng D (2020) Cluster structure prediction via revised particle-swarm optimization algorithm. Comput Phys Commun 247:106945
Li Y, Chu X, Tian D, Feng J, Mu W (2021) Customer segmentation using K-means clustering and the adaptive particle swarm optimization algorithm. Appl Soft Comput 113:107924
Feng Z, Niu W, Zhang R, Wang S, Chenge C (2019) Operation rule derivation of hydropower reservoir by k-means clustering method and extreme learning machine based on particle swarm optimization. J Hydrol 576:229–238
Silva TM, Pimentel BA, Souza RMCR, Oliveira AL (2015) Hybrid methods for fuzzy clustering based on fuzzy c-means and improved particle swarm optimization. Expert Syst Appl 42(17–18):6315–6328
Wu ZH, Wu ZC, Zhang J (2017) An improved FCM algorithm with adaptive weights based on SA-PSO. Neural Comput Appl 28(10):3113–3118
Sharma M, Chhabra JK (2019) Sustainable automatic data clustering using hybrid PSO algorithm with mutation. Sustain Comput: Inf Syst 23:144–157
Li C, Zhou J, Kou P, Xiao J (2012) A novel chaotic particle swarm optimization based fuzzy clustering algorithm. Neurocomputing 83:98–109
Gupta R, Muttoo SK, Pal SK (2017) Fuzzy C-Means clustering and particle swarm optimization based scheme for common service center location allocation. Appl Intell 47(3):624–643
Song Y, Pan S, Liu S, Wei F, Zhou MX, W Qian (2010) Association for the advancement of artificial intelligence, Constrained co-clustering for textual documents, pp 581–586
Selosse M, Jacques J, Biernacki C (2020) Model-based co-clustering for mixed type data. Comput Stat Data Anal 144:1–18
Kummamuru K, Dhawale A, Krishnapuram R (2003) Fuzzy co-clustering of documents and keywords. IEEE Int Conf on Fuzzy Syst 2:772–777
Tjhi WC, Chen L (2007) Possibilistic fuzzy co-clustering of large document collections. Pattern Recognit 40(12):3452–3466
Yan Y, Chen L, Tjhi WC (2013) Fuzzy semi-supervised co-clustering for text documents. Fuzzy Sets Syst 215:74–89
Hanmandlua M, Verma OP, Susan S, Madasu V (2013) Color segmentation by fuzzy co-clustering of chrominance color features. Neurocomputing 120:235–249
Tjhi WC, Chen L (2008) A heuristic-based fuzzy co-clustering algorithm for categorization of high-dimensional data. Fuzzy Sets Syst 159:371–389
Pham VN, Ngo LT, Pedrycz W (2016) Interval-valued fuzzy set approach to fuzzy co-clustering for data classification. Knowl-Based Syst 107:1–13
Nha PV, Long PT, Pedryczc W, Long NT (2021) Feature-reduction fuzzy co-clustering approach for hyper-spectral image analysis. Knowl-Based Syst 216:106549
Mazur JE (2006) Mathematical models and the experimental analysis of behavior. J Exp Anal Behav 85(2):275–291
Hwang C, Rhee FCH (2007) Uncertain fuzzy clustering: interval type-2 fuzzy approach to C-means. IEEE Trans Fuzzy Syst 15(1):107–120
Xing H, He H, Hu D, Jiang T, Yu X (2019) An interval Type-2 fuzzy sets generation method for remote sensing imagery classification. Comput Geosci 133:104287
Olson DL, Delen D (2008) Advanced data mining techniques, Springer ISBN 3-540-76916-1, 1st edition, page 138,
David MWP (2011) Evaluation: from Precision. Recall, F-Measure ROC, Inf, Mark Correl, Mach Learn Technol 2(1):37–63
Jiang M, Luo YP, Yang SY (2007) Stochastic convergence analysis and parameter selection of the standard particle swarm optimization algorithm. Inf Process Lett 102(115):8–16
Piotrowski AP, Napiorkowski JJ, Piotrowska AE (2020) Population size in particle swarm optimization. Swarm Evolut Comput 58:100718
FrÁ nti P, Virmajoki O, Hautamáki V (2006) Fast agglomerative clustering using a k-nearest neighbor graph. IEEE Trans Pattern Analy Mach Intell 28(11):1875–1881
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Cam Binh, L.T., Nha, P.V. Optimal centroids model approach for many-feature data structure prediction. Evol. Intel. 16, 1353–1367 (2023). https://doi.org/10.1007/s12065-022-00747-6
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s12065-022-00747-6