Abstract
Clustering techniques are aimed to partition the entire input space into disconnected sets where the members of each set are highly connected. K-harmonic means (KHM) is a well-known data clustering technique, but it runs into local optima. A two stage genetic clustering method using KHM (TSGKHM) is proposed in this research, which can automatically cluster the input data points into an appropriate number of clusters. With the best features of both the algorithm, and TSGKHM the first stage overcomes the local optima and results in optimal cluster centers, and in the second stage, results into/in optimal clusters.
The proposed method is executed on globally accepted, four real time data sets. The intermediate results are produced. The performance analysis shows that TSGKHM performs significantly better.
Chapter PDF
References
Macqueen, J.B.: Some Methods for Classification and Analysis of Multivariate Observations. In: Proceedings of 5th Berkeley Symposium on Mathematical Statistics and Probability, vol. 1, pp. 281–297. University of California Press, Berkeley (1967)
Jiawei Han, M.K.: Data Mining Concepts and Techniques, Morgan Kaufmann Publishers, An Imprint of Elsevier (2006)
Zhang, B., Hsu, M., Dayal, U.: K-harmonic means – a data clustering algorithm. Technical Report HPL-1999-124. Hewlett-Packard Laboratories (1999)
Zhang, B., Hsu, M., Dayal, U.: K-harmonic means. In: International Workshop on Temporal, Spatial and spatio-Temporal Data Mining, TSDM 2000, Lyon, France (September 12, 2000)
Cui, X., Potok, T.E.: Document clustering using particle swarm optimization. In: IEEE Swarm Intelligence Symposium, Pasadena, California (2005)
Kao, Y.T., Zahara, E., Kao, I.W.: A hybridized approach to data clustering. Expert Systems with Applications 34(3), 1754–1762 (2008)
Karegowda, A.G., Manjunath, A.S., Jayaram, M.A.: Application of Genetic Algorithm Optimized Neural Network Connection Weights For Medical Diagnosis of Pima Indians Diabetes. International Journal of Computer Applications (0975 – 8887) 43(1) (April 2012)
Painho, M., Bação, F.: Using Genetic Algorithms in Clustering Problems. In: GeoComputation. Higher Institute of Statistics and Information Management, New University of Lisbon, Travessa, EstêvãoPinto (Campolide) P-1070-124 Lisboa (2000)
Krishna, K., Murty, M.N.: Genetic k-means algorithm. IEEE Transactions on Systems, Man and Cybernetics B Cybernetics 29, 433–439 (1999)
Yin, M., Hu, Y., Yang, F., Li, X., Gu, W.: A novel hybrid K-harmonic means and gravitational search algorithm approach for clustering. Expert Systems with Applications 38, 9319–9324 (2011)
Yang, F., Sun, T., Zhang, C.: An efficient hybrid data clustering method based on K-harmonic means and Particle Swarm Optimization (2009)
Song, W., Park, S.C.: An Improved GA for Document Clustering with semantic Similarity measure. In: Fourth International IEEE Conference on Natural Computation, pp. 536–540 (2008)
Zhang, B., Hsu, M.: K-Harmonic Means - A Data Clustering Algorithm Umeshwar Dayal Software Technology Laboratory HP Laboratories, Palo Alto HPL-1999-124 (October 1999)
He, H., Tan, Y.: A two-stage genetic algorithm for automatic clustering. Neurocomputing 81, 49–59 (2012)
Jiang, H., Yi, S., Li, J., Yang, F., Hu, X.: Ant clustering algorithm with K-harmonic means clustering (2010)
Hung, C.H., Chiou, H.-M., Yang, W.-N.: Candidate groups search for K-harmonic means data clustering (2013)
Alguwaizani, A., Hansen, P., Mladenovic, N., Ngai, E.: Variable neig
Everitt, B.S., Landau, S., Leese, M., Stahl, D.: Cluster Analysis. Willey series in probability and statistics
Sahaa, S., Bandyopadhyay, S.: A generalized automatic clustering algorithm in a multiobjective framework. Applied Soft Computing 13, 89–108 (2013)
Handl, J., Knowles, J.: An evolutionary approach to multiobjective clustering. IEEE Transaction on Evolutionary Computation 11(1), 56–76 (2007)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this paper
Cite this paper
Thakare, A.D., Dhote, C.A. (2015). A Two-Stage Genetic K-harmonic Means Method for Data Clustering. In: El-Alfy, ES., Thampi, S., Takagi, H., Piramuthu, S., Hanne, T. (eds) Advances in Intelligent Informatics. Advances in Intelligent Systems and Computing, vol 320. Springer, Cham. https://doi.org/10.1007/978-3-319-11218-3_36
Download citation
DOI: https://doi.org/10.1007/978-3-319-11218-3_36
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-11217-6
Online ISBN: 978-3-319-11218-3
eBook Packages: EngineeringEngineering (R0)