Abstract
In this work we introduce a modification to an automatic non-supervised rule to select the parameters of a previously presented graph-based metric. This rule maximizes a clustering quality index providing the best possible solution from a clustering quality point of view. We apply our parameter-free PKNNG metric on gene expression data to show that the best quality solutions are also the ones that are more related to the biological classes. Finally, we compare our parameter-free metric with a group of state-of-the-art clustering algorithms. Our results indicate that our parameter-free metric performs as well as the state-of-the-art clustering methods.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Golub, T.R., Slonim, D.K., et al.: Molecular classification of cancer: class discovery and class prediction by gene expression monitoring. Science 286(5439), 531–537 (1999)
Alizadeh, A.A., Eisen, M.B., et al.: Distinct types of diffuse large B-cell lymphoma identified by gene expression profiling. Nature 403(6769), 503–511 (2000)
Bayá, A.E., Granitto, P.M.: Clustering gene expression data with a penalized graph-based metric. BMC Bioinformatics 12, 2 (2011), http://www.biomedcentral.com/1471-2105/12/2
Bayá, A.E., Granitto, P.M.: Improved graph-based metrics for clustering high-dimensional datasets. In: Kuri-Morales, A., Simari, G.R. (eds.) IBERAMIA 2010. LNCS (LNAI), vol. 6433, pp. 184–193. Springer, Heidelberg (2010)
Ng, A., Jordan, M., Weiss, Y.: On spectral clustering: Analysis and an algorithm, Vancouver, British Columbia, Canada, December 3-8. Advances in Neural Information Processing Systems, vol. 14, pp. 849–856. MIT Press, Cambridge (2001)
Fred, A., Jain, A.K.: Combining Multiple Clusterings Using Evidence Accumulation. IEEE Trans. Pattern Anal. Mach. Intell. 27(6), 835–850 (2005)
Kim, E.-K., Kim, S.-Y., et al.: MULTI-K: accurate classification of microarray subtypes using ensemble k-means clustering. BMC Bioinformatics 10(1) (2009)
Yeung, K., Fraley, C., et al.: Model-based clustering and data transformations for gene expression data. Bioinformatics 17(10), 977–987 (2001)
Thalamuthu, A., Mukhopadhyay, I., et al.: Evaluation and comparison of gene clustering methods in microarray analysis. Bioinformatics 22(19), 2405 (2006)
Rousseeuw, P.: Silhouettes: a graphical aid to the interpretation and validation of cluster analysis. J. Comput. Appl. Math. 20, 53–65 (1987)
Kaufman, L., Rousseeuw, P.: Finding Groups in Data: An Introduction to Cluster Analysis. John Wiley & Sons, New York (1990)
Monti, S., Tamayo, P., et al.: Consensus Clustering: A Resampling-Based Method for Class Discovery and Visualization of Gene Expression Microarray Data. Machine Learning 52(1-2), 91–118 (2003)
Yeoh, E.-J., Ross, M.E., Shurtleff, S.A., et al.: Classification, subtype discovery, and prediction of outcome in pediatric acute lymphoblastic leukemia by gene expression profiling. Cancer Cell 1, 133–143 (2002)
Su, A.I., Cooke, M.P., et al.: Large-scale Analysis of the Human and Mouse Transcriptomes. Proceedings of the National Academy of Sciences USA 99, 4465–4470 (2002)
Pomeroy, S., Tamayo, P., et al.: Gene Expression-Based Classification and Outcome Prediction of Central Nervous System Embryonal Tumors. Nature 415, 436–442 (2002)
Eisen, M.B., Spellman, P.T., et al.: Cluster analysis and display of genome-wide expression patterns. Proceedings of the National Academy of Sciences USA 95, 14863–14868 (1998)
Milligan, G., Cooper, M.: A study of comparability of external criteria for hierarchical cluster analysis. Multivariate Behavioral Research 21, 441–458 (1986)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Bayá, A.E., Granitto, P.M. (2011). Improved Gene Expression Clustering with the Parameter-Free PKNNG Metric. In: Norberto de Souza, O., Telles, G.P., Palakal, M. (eds) Advances in Bioinformatics and Computational Biology. BSB 2011. Lecture Notes in Computer Science(), vol 6832. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-22825-4_7
Download citation
DOI: https://doi.org/10.1007/978-3-642-22825-4_7
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-22824-7
Online ISBN: 978-3-642-22825-4
eBook Packages: Computer ScienceComputer Science (R0)