Abstract
A challenge in partitional clustering is determining the number of clusters that best characterize a set of observations. In this paper, we present a novel approach for determining both an optimal number of clusters and partitioning of the data set. Our new algorithm is based on cooperative coevolution and inspired by the natural process of sympatric speciation. We have evaluated our algorithm on a number of synthetic and real data sets from pattern recognition literature and on a recently-collected set of epigenetic data consisting of DNA methylation levels. In a comparison with a state-of-the-art algorithm that uses a variable string-length GA for clustering, our algorithm demonstrated a significant performance advantage, both in terms of determining an appropriate number of clusters and in the quality of the cluster assignments as reflected by the misclassification rate.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
MacQueen, J.: Some methods for classification and analysis of multivariate observations. In: 5th Berkeley Symp. on Mathematical Statistics and Probability, pp. 281–297. Univ. of Calif. Press, Berkeley (1967)
Bezdek, J.C.: Pattern Recognition with Fuzzy Objective Function Algorithms. Kluwer Academic Publishers, Dordrecht (1981)
Beringer, J., Hüllermeier, E.: Adaptive optimization of the number of clusters in fuzzy clustering. In: IEEE Int. Conf. on Fuzzy Systems. IEEE, Los Alamitos (2007)
Potter, M.A., De Jong, K.A.: Cooperative coevolution: An architecture for evolving coadapted subcomponents. Evolutionary Computation 8(1), 1–29 (2000)
Maulik, U., Bandyopadhyay, S.: Fuzzy partitioning using real-coded variable-length genetic algorithm for pixel classification. IEEE Trans. on Geoscience and Remote Sensing 41(5), 1075–1081 (2003)
Pakhira, M.K., Bandyopadhyay, S., Maulik, U.: A study of some fuzzy cluster validity indices, genetic clustering and application to pixel classification. Fuzzy Sets and Systems 155, 191–214 (2005)
Murthy, C.A., Chowdhury, N.: In search of optimal clusters using genetic algorithms. Pattern Recognition Letters 17, 825–832 (1996)
Raghavan, V.V., Birchand, K.: A clustering strategy based on a formalism of the reproductive process in a natural system. In: 2nd Annual International ACM SIGIR Conf. on Information Storage and Retrieval, pp. 10–22. ACM, New York (1979)
Babu, G.P., Murty, M.N.: Clustering with evolution strategies. Pattern Recognition 27(2), 321–329 (1994)
Scheunders, P.: A genetic c-means clustering algorithm applied to color image quantization. Pattern Recognition 30(6), 859–866 (1997)
Hall, L.O., Ozyurt, B., Bezdek, J.C.: Clustering with a genetically optimized approach. IEEE Trans. on Evolutionary Computation 3(2), 103–112 (1999)
Maulik, U., Bandyopadhyay, S.: Genetic algorithm-based clustering technique. Pattern Recognition 33, 1455–1465 (2000)
Bandyopadhyay, S., Maulik, U., Mukhopadhyay, A.: Multiobjective genetic clustering for pixel classification in remote sensing imagery. IEEE Trans. on Geoscience and Remote Sensing 45(5), 1506–1511 (2007)
Bandyopadhyay, S., Saha, S.: GAPS: A new symmetry based genetic clustering technique. Pattern Recognition 40, 3430–3451 (2007)
Lee, C.Y.: Efficient Automatic Engineering Design Synthesis via Evolutionary Exploration. PhD thesis, Calif. Institute of Technology, Pasadena, Calif. (2002)
Bandyopadhyay, S., Maulik, U.: Genetic clustering for automatic evolution of clusters and application to image classification. Pattern Recognition 35, 1197–1208 (2002)
Kharma, N., Suen, C.Y., Guo, P.F.: PalmPrints: A novel co-evolutionary algorithm for clustering finger images. In: GECCO, pp. 322–331. Springer, Heidelberg (2003)
Xie, X.L., Beni, G.: A validly measure for fuzzy clustering. IEEE Trans. on Pattern Analysis and Machine Intelligence 13(8), 841–847 (1991)
Pakhira, M.K., Bandyopadhyay, S., Maulik, U.: Validity index for crisp and fuzzy clusters. Pattern Recognition 37, 487–501 (2004)
Asuncion, A., Newman, D.: UCI machine learning repository (2007)
Ehrich, M., Nelson, M.R., Stanssens, P., Zabeau, M., Liloglou, T., Xinarianos, G., Cantor, C.R., Field, J.K., van den Boom, D.: Quantitative high-throughput analysis of dna methylation patterns by base-specific cleavage and mass spectrometry. Proceedings of the National Academy of Sciences USA 102(44), 15785–15790 (2005)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Potter, M.A., Couldrey, C. (2010). A Cooperative Coevolutionary Approach to Partitional Clustering. In: Schaefer, R., Cotta, C., Kołodziej, J., Rudolph, G. (eds) Parallel Problem Solving from Nature, PPSN XI. PPSN 2010. Lecture Notes in Computer Science, vol 6238. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-15844-5_38
Download citation
DOI: https://doi.org/10.1007/978-3-642-15844-5_38
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-15843-8
Online ISBN: 978-3-642-15844-5
eBook Packages: Computer ScienceComputer Science (R0)