Abstract
K-means is a widely used classical clustering algorithm in pattern recognition, image segmentation, bioinformatics and document clustering. But, it is easy to fall into local optimum and is sensitive to the initial choice of cluster centers. As a remedy, a popular trend is to integrate the swarm intelligence algorithm with K-means to obtain hybrid swarm intelligence K-means algorithms. Such as, K-means is combined with particle swarm optimization algorithm to obtain particle swarm optimization K-means, and is also combined with genetic mechanism to obtain Genetic K-means algorithms. The classical K-means clustering algorithm requires the number of cluster centers in advance. In this paper, an automatic quantum genetic K-means algorithm for unknown K (AQGUK) is proposed to accelerate the convergence speed and improve the global convergence of AGUK. In AQGUK, a Q-bit based representation is employed for exploration and exploitation in discrete 0–1 hyperspace using rotation operation of quantum gate as well as the genetic operations (selection, crossover and mutation) of Q-bits. The length of Q-bit individuals is variable during the evolution, which is Different from the typical genetic algorithms. Without knowing the exact number of clusters beforehand, AQGUK can obtain the optimal number of clusters and provide the optimal cluster centroids. Five gene datasets are used to validate AQGUK, AGUK and K-means. The experimental results show that AQGUK is effective and promising.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Steinhaus, H.: Sur la division des corp materiels en parties. Bull Acad Polon Sci. 3, 801–804 (1956)
Macqueen, J.: Some methods for classification and analysis of multivariate observations. In Proceedings of the 5th Berkeley Symposium on Mathematical Statistics and Probability, pp. 281–297 (1967)
Ishak Boushaki, S., Kamel, N.: A new quantum chaotic cuckoo search algorithm for data clustering. Expert Syst. Appl. 96, 358–372 (2018)
Xiao, J., Yan, Y.P.: A quantum-inspired genetic algorithm for k-means clustering. Expert Syst. Appl. 37, 4966–4973 (2010)
Krishna, K., Murty, M.N.: Genetic k-means algorithm. IEEE Trans. Syst. Man Cybern. 29, 433–439 (1999)
Du, Z., Wang, Y.: PK-means: a new algorithm for gene clustering. Comput. Biol. Chem. 32, 243–247 (2008)
Hey, T.: Quantum computing: an introduction. Comput. Control Eng. J. 10(3), 105–112 (1999)
Han, K.H., Kim, J.H.: Quantum-inspired evolutionary algorithm for a class of combinatorial optimization. IEEE Trans. Evol. Comput. 6(6), 580–593 (2002)
Xie, X.L., Beni, G.: A validity measure for fuzzy clustering. IEEE Trans. Pattern Anal. Mach. Intell. 13, 841–847 (1991)
Liu, Y.G.: Automatic clustering using genetic algorithms. Appl. Math. Comput. 218, 1267–1279 (2011)
Chu, S., DeRisi, J.: The transcriptional program of sporulation in budding yeast. Science 282, 699–705 (1998)
Spellman, P.T.: Comprehensive identification of cell cycle-regulated genes of the yeast saccharomyces cerevisiae by microarray hybridization. Mol. Biol. 9, 3273–3297 (1998)
Alizadeh, A.A., Eisen, M.B.: Distinct types of diffuse large b-cell lymphoma identified by gene expression profiling. Nature 403, 503–511 (2000)
Yoon, D., Lee, E.K.: Robust imputation method for missing values in microarray data. BMC Bioinform. 8, 6–12 (2007)
Troyanskaya, O., Cantor, M.: Missing value estimation methods for DNA microarrays. Bioinformatics 17, 520–525 (2001)
Chun, H., Feng, L.: A genetic xk-means algorithm with empty cluster reassignment. Symmetry 11(744), 49–65 (2019)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Switzerland AG
About this paper
Cite this paper
Hua, C., Liu, N. (2020). A Quantum-Inspired Genetic K-Means Algorithm for Gene Clustering. In: Han, M., Qin, S., Zhang, N. (eds) Advances in Neural Networks – ISNN 2020. ISNN 2020. Lecture Notes in Computer Science(), vol 12557. Springer, Cham. https://doi.org/10.1007/978-3-030-64221-1_2
Download citation
DOI: https://doi.org/10.1007/978-3-030-64221-1_2
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-64220-4
Online ISBN: 978-3-030-64221-1
eBook Packages: Computer ScienceComputer Science (R0)