Abstract
This paper proposes a novel kernel clustering algorithm using a hybrid memetic algorithm for clustering complex, unlabeled, and linearly non-separable datasets. The kernel function can transform nonlinear data into a high dimensional feature space. It increases the probability of the linear separability of the patterns within the transformed space and simplifies the associated data structure. According to the distribution of various datasets, three local learning operators are designed; meanwhile double mutation operators incorporated into local learning operators to further enhance the ability of global exploration and overcome premature convergence effectively. The performance comparisons of the proposed method with k-means, kernel k-means, global kernel k-means and spectral clustering algorithms on artificial datasets and UCI datasets indicate that the proposed clustering algorithm outperforms the compared algorithms.
Similar content being viewed by others
References
Åberg MB, Wessberg J (2010) A memetic algorithm for selection of 3D clustered features with applications in neuroscience. Paper presented at the International Conference on Pattern Recognition, Istanbul 2010. pp 1076–1079
Chellapilla K (1998) Combining mutation operators in evolutionary programming. IEEE Trans Evol Comput 2(3):91–96
Chen C, Low CP, Yang Z (2009) Preserving and exploiting genetic diversity in evolutionary programming algorithms. IEEE Trans Evol Comput 13(3):661–673
Das S, Abraham A, Konar A (2008) Automatic kernel clustering with a multi-elitist particle swarm optimization algorithm. Pattern Recogn Lett 29(5):688–699
Das S, Abraham A, Chakraborty UK, Konar A (2009) Differential evolution using a neighborhood based mutation operator. IEEE Trans Evol Comput 13(3):526–553
Ding C, He XF, Simon HD (2005) On the equivalence of nonnegative matrix factorization and spectral clustering. Paper presented at the SIAM International Conference on Data Mining, Newport Beach 2005. pp 606–610
Dong H, He J, Huang H, Hou W (2007) Evolutionary programming using a mixed mutation strategy. Inf Sci 177(1):312–327
Filippone M, Camastra F, Masulli F, Rovetta S (2008) A survey of kernel and spectral methods for clustering. Pattern Recogn 41:176–190
Geng X, Zhan DC, Zhou ZH (2005) Supervised nonlinear dimensionality reduction for visualization and classification. IEEE Trans Syst Man Cybern B 35(6):1098–1107
Guan NY, Tao DC, Luo ZG, Yuan B (2011) Non-negative patch alignment framework. IEEE Trans Neural Netw 22(8):1218–1230
Handl J, Knowles J (2007) An evolutionary approach to multiobjective clustering. IEEE Trans Evol Comput 11(1):56–76
Hartigan JA, Wong MA (1979) A k-means clustering algorithm. Applied Statistics 28:100–108
Kim J, Park H (2008) Toward Faster Nonnegative Matrix Factorization: a new algorithm and comparisons. Paper presented at the 8th IEEE International Conference on Data Mining, New York, 2008
Le MN, Ong YS, Jin Y, Sendhoff B (2009) Lamarckian memetic algorithms: local optimum and connectivity structure analysis. Memetic Comput J 1(3):175–190
Lee CY, Yao X (2004) Evolutionary programming using mutations based on the Lévy probability distribution. IEEE Trans Evol Comput 8(1):1–13
Li B, Zhao SL, Fang L (2009) Optimized GM (1, 1) based on Romber algorithm and quadratic interpolation method. Paper presented at the International Conference on Apperceiving Computing and Intelligence Analysis, Oct 2009, pp 128–131
Liang KH, Yao X (2000) Evolutionary search of approximated N-dimensional landscapes. Int J Knowl Int Eng Syst 4(3):172–183
Liang KH, Yao X, Newton C (1999) Combining landscape approximation and local search in global optimization. In: Proceedings of the 1999 Congress on Evolutionary Computation, 6–9 July 1999, vol 2. IEEE Press, pp 1514–1520
Liang RS, Jiang YF, Bian R (2009) Ordered hill climbing search for heuristic planning. Paper presented at the International Conference on Information Engineering and Computer Science, Tokyo, Dec 2009, pp 1–4
Lim M, Yuan Y, Omatu S (2000) Efficient genetic algorithms using simple genes exchange local search policy for the quadratic assignment problem. Comput Optimization Appl 15(3):249–268
Moscato PA (1989) On evolution, search, optimization, genetic algorithms and martial arts: Towards memetic algorithms. Caltech Concurrent Computation Program Report 826, Caltech, Pasadena
Ng AY, Jordan M, Weiss Y (2001) On spectral clustering: analysis and an algorithm. Advances in Neural Information Processing Systems 14. MIT Press, Cambridge, pp 849–856
Scholkopf B, Smola A, Muller KR (1998) Nonlinear component analysis as a kernel eigenvalue problem. Neural Comput 10:1299–1319
Schwefel HP (1995) Evolution and optimum seeking. Wiley, New York
Sheng WG, Liu XH, Fairhurst MC (2008) A niching memetic algorithm for simultaneous clustering and feature selection. IEEE Trans Knowl Data Eng 20(7):868–879
Tzortzis GF, Likas AC (2009) The global kernel k-means algorithm for clustering in feature space. IEEE Trans Neural Netw 20(7):1181–1194
Wang Z, Liu JL (2009) Hybrid memetic algorithm for uniting classes of university timetabling problem. Paper presented at the International Conference on Computational Intelligence and Security, Beijing, 11–14 Dec 2009, 978-0-7695-3931-7/09
Yao X, Liu Y (1996) Fast evolutionary programming. In: Proceedings of the Fifth Annual Conference on Evolutionary Programming (EP’96), 29/2-2/3/96. MIT Press, San Diego, pp 451–460
Yao X, Liu Y (1999) Evolutionary programming made faster. IEEE Trans Evol Comput 3(2):82–102
Zhang XR, Jiao LC, Liu F, Bo LF, Gong MG (2008) Spectral clustering ensemble applied to texture features for SAR image segmentation. IEEE Trans Geosci Remote Sens 46(7):2126–2136
Zhang TH, Tao DC, Li XL, Yang J (2009) Patch alignment for dimensionality reduction. IEEE Trans Knowl Data Eng 21(9):1299–1313
Zhou TY, Tao DC, Wu XD (2011) Manifold elastic net: a unified framework for sparse dimension reduction. Data Min Knowl Disc 22(3):340–371
Acknowledgments
This work was supported by the Program for New Century Excellent Talents in University (No. NCET-12-0920), the National Natural Science Foundation of China (Nos. 61272279, 61001202 and 61203303), the China Postdoctoral Science Foundation Funded Project (Nos. 20080431228, 20090461283 and 20090451369), the China Postdoctoral Science Foundation Special Funded Project (Nos. 200801426 and 201104618), the Fundamental Research Funds for the Central Universities (Nos. K50511020014, K50511020011 and K50510020011), the Provincial Natural Science Foundation of Shaanxi of China (No. 2011JQ8020) and the Fund for Foreign Scholars in University Research and Teaching Programs (the 111 Project) (No. B07048).
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Li, Y., Li, P., Wu, B. et al. Kernel clustering using a hybrid memetic algorithm. Nat Comput 12, 605–615 (2013). https://doi.org/10.1007/s11047-013-9365-x
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11047-013-9365-x