Abstract
Clustering has been used as a popular technique for identifying a natural grouping or meaningful partition of a given data set by using a distance or similarity function. This paper proposes a novel real coded Genetic algorithm (GA) for the development of optimal Gustafson Kessel (GK) clustering algorithm. In this work, the objective function of the GK algorithm is optimized using real coded genetic algorithm. The cluster centers are represented as real numbers and real-parameter genetic operators are applied to obtain the optimal cluster centers that minimize the intra-cluster distance. The performance of the proposed approach is demonstrated through three gene expression data sets. Xie-Beni index is used to arrive at the best possible number of clusters. The proposed method has produced the objective function value which is less than the value obtained using K-Means, Fuzzy C-Means and GK algorithms. Statistical analysis of the test results shows the superiority of the proposed algorithm over the existing methods.
Keywords
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
Babuska, R.: Fuzzy Modeling for Control. Kluwer Academic Publishers, Norwell (1999)
Babuska, R., Van der Veen, P.J., Kaymak, U.: Improved covariance estimation for Gustafson-Kessel clustering. In: IEEE (2002)
Chandrasekhar, T., Thangavel, K., Elayaraja, E.: Effective Clustering Algorithms for Gene Expression Data. Int. J. Comput. Appl. (0975 – 9997), 32(4), (2011)
Kim, D.W., Lee, K.H., Lee, D.: Detecting clusters of different geometrical shapes in microarray gene expression data. Bioinformatics 21(9), 1927–1934 (2005)
Goldberg, David E.: Genetic Algorithms in Search, Optimization and Machine Learning. Pearson Education, New York (2011)
Jiang, D., Tang, C., Zhang, A.: Cluster analysis for gene expression data: a survey. IEEE Trans. Knowl. Data Eng. 16(11), 1370–1386 (2004)
Devaraj, D.: Improved genetic algorithm for multi-objective reactive power dispatch problem. Eur. Trans. Electr. Power 17(6), 569–581 (2007). doi:10.1002/etep.146
Falehi, A.D., Rostami, M., Doroudi, A., Ashrafian, A.: Optimization and coordination of SVC-based supplementary controllers and PSSs to improve power system stability using a genetic algorithm. Turk. J. Electr. Eng. Comput. Sci. 20(5), 639–654 (2012)
Wu, F.X., Zhang, W.J., Kusalik, A.J.: A genetic k-means clustering algorithm applied to gene expression data. In: Xiang, Y., Chaib-draa, B. (eds.) Canadian AI 2003. LNCS (LNAI), vol. 2671, pp. 520–526. Springer, Heidelberg (2003)
Ganesh Kumar, P., Devaraj, D.: Improved genetic algorithm for optimal design of fuzzy classifier. Int. J. Comput. Appl. Technol. 35(234), 97–103 (2009)
Ganesh Kumar, P., Rani, C., Devaraj, D., Aruldoss Albert Victoire, T.: Hybrid ant bee algorithm for fuzzy expert system based sample classification, IEEE/ACM Trans. Comput. Biol. Bioinf. (2013). doi. 10.1109/TCBB.2014.2307325. ISSN 1545-5963
Yi, G., Sze, S.H., Thon, M.R.: Identifying clusters of functionally related genes in genomes. Bioinformatics 23(9), 1053–1060 (2007). doi:10.1093/bioinformatics/btl673
Gibbons, F., Roth, F.: Judging the quality of gene expression-based clustering methods using gene annotation. Genome Res. 12, 1574–1591 (2002)
Manda, K., Hanuman, A.S., Satapathy, S.C., Chaganti, V., Babu, A.V.: A software tool for data clustering using particle swarm optimization. In: Panigrahi, B.K., Das, S., Suganthan, P.N., Dash, S.S. (eds.) SEMCCO 2010. LNCS, vol. 6466, pp. 278–285. Springer, Heidelberg (2010)
Piyushkumar, M.A., Rajapakse, J.C.: SVM-RFE with MRMR filter for gene selection. IEEE Transa. Nanobiosci. 9(1), 31–37 (2010)
Ravi, V., Aggarwal, N., Chauhan, N.: Differential evolution based fuzzy clustering. In: Panigrahi, B.K., Das, S., Suganthan, P.N., Dash, S.S. (eds.) SEMCCO 2010. LNCS, vol. 6466, pp. 38–45. Springer, Heidelberg (2010)
Sarmah, R.: Gene expression data clustering using a fuzzy link based approach. Int. J. Comput. Inf. Syst. Ind. Manage. Appl. 5, 532–541 (2013)
Xie, X.L., Beni, G.: A Validity Measure for Fuzzy Clustering. IEEE Trans. Pattern Anal. Mach. Intell. 13(9), 941–947 (1991)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this paper
Cite this paper
Devi Arockia Vanitha, C., Devaraj, D., Venkatesulu, M. (2015). Real Coded Genetic Algorithm for Development of Optimal G-K Clustering Algorithm. In: Panigrahi, B., Suganthan, P., Das, S. (eds) Swarm, Evolutionary, and Memetic Computing. SEMCCO 2014. Lecture Notes in Computer Science(), vol 8947. Springer, Cham. https://doi.org/10.1007/978-3-319-20294-5_23
Download citation
DOI: https://doi.org/10.1007/978-3-319-20294-5_23
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-20293-8
Online ISBN: 978-3-319-20294-5
eBook Packages: Computer ScienceComputer Science (R0)