Abstract
In view of the model of bird flocking, particle swarm optimization (PSO) is a promising method to tackle optimization. This study benefits from the fact that the living environment affects behaviors of the bird flocking. That is, a well-defined environmental factor can improve the performance of PSO. Thus, the environment factor is taken into account to inspire the robust behaviors of bird flocking in depth. Specifically, it not only can carry out effective searching in limited flying space, but also can strengthen the social behaviors of individual. In the field of clustering, it can be regarded as a search optimization issue. Like the utilization of some useful information generated in its process, environment factor is considered and environment factor-inspired PSO (EPSO) is proposed in this study. To take full advantage of EPSO for solving issue of clustering, we divide its process into two stages. In the first stage, the environment factor is imported as a refined search technology to achieve the multi-local optimums with high probability. In the second stage, the manifold information, i.e., individual, swarm and environment factors, is considered to improve its global search capacity. Such an approach can effectively overcome the defect of PSO being prone to being trapped in local optima. To demonstrate the validity of our approach, EPSO, conventional PSO, genetic algorithm, \(K\)-means, artificial bee colony and hybrid ABC are compared with benchmark document collections. The experiment results indicate that EPSO performs better than these state-of-the-art clustering algorithms in most cases.
Similar content being viewed by others
References
Altingovde IS, Subakan ON, Ulusoy O (2013) Cluster searching strategies for collaborative recommendation systems. Inf Process Manag 49:688–697
Anaya AR, Boticario JG (2011) Application of machine learning techniques to analyse student interactions and improve the collaboration process. Expert Syst Appl 38:1171–1181
Brits R, Engelbrecht AP, Van den Bergh F (2002) A niching particle swarm optimize. In: Proceedings of the 4th Asia-Pacific conference on simulated evolution and learning, vol 2, pp 692–696
Carlisle A, Dozier G (2001) An off-the-shelf PSO. Proceedings of the workshop on particle swarm optimization 1:1–6
Carmona J (2012) Projection approaches to process mining using region-based techniques. Data Min Knowl Disc 24(1):218–246
Chen T, Zhang NL, Liu TF, Poon KM, Wang Y (2012) Model-based multidimensional clustering of categorical data. Artif Intell 176(1):2246–2269
Cui X, Potok TE, Palathingal P (2005) Document clustering using particle swarm optimization. In: Proceeding of IEEE international conference on awarm intelligence symposium, pp 185–191
Eberhart, RC, Shi Y (2000) Comparing inertia weights and constriction factors in particle swarm optimization. In: Proceedings of the 2000 congress on evolutionary computation, pp 84–88
Engelbrecht A, Masiye B, Pampard G (2005) Niching ability of basic particle swarm optimization algorithms. In: Proceedings of IEEE internetional conference on swarm intelligence symposium, pp 397–400
Guha S, Rastogi R, Shim K (2000) Rock: a robust clustering algorithm for categorical attributes. Inform Syst 25(5):345–366
Hartigan JA (1975) Clustering algorithms. Wiley series in probability and mathematical statistics, Wiley, New York
Jain AK, Murty MN, Flynn PJ (1999) Data clustering: a review. ACM Comput Surv 31(3):264–323
Karaboga D, Ozturk C (2011) A novel clustering approach: artificial bee colony (ABC) algorithm. Appl Soft Comput 11(1):652–657
Kennedy JF, Eberhart RC, Shi Y (2001) Swarm intelligence. The Morgan Kaufmann series in evolutionary computation, Morgan Kaufmann, San Francisco
Kennedy J, Eberhart R (1995) Particle swarm optimization. In: Proceedings of IEEE international conference on neural networks, vol 4, pp 1942–1948
Koontz WL, Narendra PM, Fukunaga K (1976) A graph-theoretic approach to nonparametric cluster analysis. IEEE T Comput 100(9):936–944
Lai HP, Visani M, Boucher A, Ogier JM (2014) A new interactive semi-supervised clustering model for large image database indexing. Pattern Recogn Lett 37:94–106
Lewis D (1987) Reuters-21578. Test Collections
Li XD (2010) Niching without niching parameters: particle swarm optimization using a ring topology. IEEE T Evolut Comput 14(1):150–169
Liao YF, Yau DH, Chen CL (2012) Evolutionary algorithm to traveling salesman problems. Comput Math Appl 64(5):788–797
Liu YG, Wu XD, Shen YD (2011) Automatic clustering using genetic algorithms. Appl Math Comput 218(4):1267–1279
Manning CD, Raghavan P, Schütze H (2008) Introduction to information retrieval. Cambridge University Press, New York, pp 327–331
Maulik U, Bandyopadhyay S (2000) Genetic algorithm-based clustering technique. Pattern Recogn 33(9):1455–1465
Pang GS, Jiang SY (2013) A generalized cluster centroid based classifier for text categorization. Inform Process Manag 49(2):576–586
Porter MF (2006) An algorithm for suffix stripping. Progr Electr Lib 40(3):211–218
Qu BY, Suganthan PN, Das S (2013) A distance-based locally informed particle swarm model for multimodal optimization. IEEE T Evolut Comput 17(3):387–402
Raghavan VV, Birchard K (1979) A clustering strategy based on a formalism of the reproductive process in natural systems. ACM SIGIR Forum 14(2):10–22
Rana S, Jasola S, Kumar R (2011) A review on particle swarm optimization algorithms and their applications to data clustering. Artif Intell Rev 35(3):211–222
Rennie J (2001) Original 20 Newsgroups data set
Salton G, Buckley C (1988) Term-weighting approaches in automatic text retrieval. Inform Process Manag 24(5):513–523
Shi Y, Eberhart R (1998) A modified particle swarm optimizer. In: The 1998 IEEE international conference on evolutionary computation proceedings, pp 69–73
Steinbach M, Karypis G, Kumar V (2000) A comparison of document clustering techniques. In: Proceedings of KDD workshop on text mining, vol 400, pp 525–526
Tan KS, Isa NAM (2011) Color image segmentation using histogram thresholding: fuzzy C-means hybrid approach. Pattern Recogn 44(1):1–15
Van der Merwe DA, Engelbrecht AP (2003) The 2003 Congress on Evolutionary Computation. Data clustering using particle swarm optimization 1:215–220
Vizine AL, de Castro LN, Hruschka ER, Gudwin RR (2005) Towards improving clustering ants: an adaptive ant clustering algorithm. Informatica 29(2):143–154
Voorhees EM (1986) Implementing agglomerative hierarchic clustering algorithms for use in document retrieval. Inform Process Manag 22(6):465–476
Wolfe JH (1970) Pattern clustering by multivariate mixture analysis. Multivar Behav Res 5(3):329–350
Xia HX, Wang SG, Yoshida T (2006) A modified ant-based text clustering algorithm with semantic similarity measure. J Syst Sci Syst Eng 15(4):474–492
Yan XH, Zhu YL, Zou WP, Wang L (2012) A new approach for data clustering using hybrid artificial bee colony algorithm. Neurocomputing 97:241–250
Zhao Y, Karypis G (2004) Empirical and theoretical comparisons of selected criterion functions for document clustering. Mach Learn 55(3):311–331
Acknowledgments
The authors thank the Editors and reviewers for providing very helpful comments and suggestions. Their insight and comments led to a better presentation of the ideas expressed in this paper. This work was sponsored by National Natural Science Foundation of China (61103129, 61170121), the fourth stage of Brain Korea 21 Project. Natural Science Foundation of Jiangsu Province (SBK201122266), SRF for ROCS, SEM, and the Specialized Research Fund for the Doctoral Program of Higher Education (20100093120004).
Author information
Authors and Affiliations
Corresponding author
Additional information
Communicated by C.-S. Lee.
Rights and permissions
About this article
Cite this article
Song, W., Ma, W. & Qiao, Y. Particle swarm optimization algorithm with environmental factors for clustering analysis. Soft Comput 21, 283–293 (2017). https://doi.org/10.1007/s00500-014-1458-7
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00500-014-1458-7