Abstract
We consider the choice of clustering criteria for use in multiobjective data clustering. We evaluate four different pairs of criteria, three employed in recent evolutionary algorithms for multiobjective clustering, and one from Delattre and Hansen’s seminal exact bicriterion method. The criteria pairs are tested here within a single multiobjective evolutionary algorithm and representation scheme to isolate their effects from other considerations. Results on a range of data sets reveal significant performance differences, which can be understood in relation to certain types of challenging cluster structure, and the mathematical form of the criteria. A performance advantage is generally found for those methods that make limited use of cluster centroids and assess partitionings based on aggregate measures of the location of all data points.
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Bandyopadhyay, S., Mukhopadhyay, A., Maulik, U.: An improved algorithm for clustering gene expression data. Bioinformatics 23(21), 2859–2865 (2007)
Caballero, R., Laguna, M., Martí, R., Molina, J.: Scatter tabu search for multiobjective clustering problems. Journal of the Operational Research Society 62(11), 2034–2046 (2010)
Corne, D., Jerram, N., Knowles, J., Oates, M.: PESA-II: Region-based selection in evolutionary multiobjective optimization. In: Proceedings of the Genetic and Evolutionary Computation Conference, GECCO 2001, pp. 283–290. Morgan Kaufmann Publishers (2001)
Delattre, M., Hansen, P.: Bicriterion cluster analysis. IEEE Transactions on Pattern Analysis and Machine Intelligence 2(4), 277–291 (1980)
Halkidi, M., Batistakis, Y., Vazirgiannis, M.: On clustering validation techniques. Journal of Intelligent Information Systems 17(2), 107–145 (2001)
Handl, J., Knowles, J.: http://personalpages.manchester.ac.uk/mbs/julia.handl/moc.html
Handl, J., Knowles, J.: Exploiting the Trade-off — The Benefits of Multiple Objectives in Data Clustering. In: Coello Coello, C.A., Hernández Aguirre, A., Zitzler, E. (eds.) EMO 2005. LNCS, vol. 3410, pp. 547–560. Springer, Heidelberg (2005)
Handl, J., Knowles, J.: An evolutionary approach to multiobjective clustering. IEEE Transactions on Evolutionary Computation 11(1), 56–76 (2007)
Handl, J., Knowles, J., Kell, D.B.: Computational cluster validation for post-genomic data analysis. Bioinformatics 21(15), 3201–3212 (2005)
Hansen, P., Delattre, M.: Complete-link cluster analysis by graph coloring. Journal of the American Statistical Association, 397–403 (1978)
Jain, A., Murty, M., Flynn, P.: Data clustering: a review. ACM Computing Surveys 31(3), 264–323 (1999)
Mukhopadhyay, A., Maulik, U., Bandyopadhyay, S.: Multiobjective genetic algorithm-based fuzzy clustering of categorical attributes. IEEE Transactions on Evolutionary Computation 13(5), 991–1005 (2009)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Handl, J., Knowles, J. (2012). Clustering Criteria in Multiobjective Data Clustering. In: Coello, C.A.C., Cutello, V., Deb, K., Forrest, S., Nicosia, G., Pavone, M. (eds) Parallel Problem Solving from Nature - PPSN XII. PPSN 2012. Lecture Notes in Computer Science, vol 7492. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-32964-7_4
Download citation
DOI: https://doi.org/10.1007/978-3-642-32964-7_4
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-32963-0
Online ISBN: 978-3-642-32964-7
eBook Packages: Computer ScienceComputer Science (R0)