An Evolutionary Approach for Sample-Based Clustering on Microarray Data

Glez-Peña, Daniel; Díaz, Fernando; Méndez, José R.; Corchado, Juan M.; Fdez-Riverola, Florentino

doi:10.1007/978-3-642-02481-8_148

Daniel Glez-Peña²³,
Fernando Díaz²⁴,
José R. Méndez²³,
Juan M. Corchado²⁵ &
…
Florentino Fdez-Riverola²³

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 5518))

Included in the following conference series:

International Work-Conference on Artificial Neural Networks

2647 Accesses

Abstract

Sample-based clustering is one of the most common methods for discovering disease subtypes as well as unknown taxonomies. By revealing hidden structures in microarray data, cluster analysis can potentially lead to more tailored therapies for patients as well as better diagnostic procedures. In this work, we present a novel method for automatically discovering clusters of samples which are coherent from a genetic point of view. Each possible cluster is characterized by a fuzzy pattern which maintains a fuzzy discretization of relevant gene expression values. Noise genes are identified and removed from the fuzzy pattern based on their probability of appearance. Possible clusters are randomly constructed and iteratively refined by following a probabilistic search and an optimization schema. Experimental results over publicly available microarray data show the effectiveness of the proposed method.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Xing, E.P., Karp, R.M.: Cliff: Clustering of high-dimensional microarray data via iterative feature filtering using normalized cuts. Bioinformatics 17(1), 306–315 (2001)
Article Google Scholar
Jiang, D., Tang, C., Zhang, A.: Cluster Analysis for Gene Expression Data: A Survey. IEEE Transactions on Knowledge and Data Engineering 16(11), 1370–1386 (2004)
Article Google Scholar
Alter, O., Brown, P.O., Bostein, D.: Singular value decomposition for genome-wide expression data processing and modeling. Proceedings of the National Academy of Sciences of the United States of America 97(18), 10101–10106 (2000)
Article Google Scholar
Ding, C.: Analysis of gene expression profiles: class discovery and leaf ordering. In: Proceedings of the Six Annual International Conference on Computational Molecular Biology, pp. 127–136 (2002)
Google Scholar
Yeung, K.Y., Ruzzo, W.L.: Principal component analysis for clustering gene expression data. Oxford Bioinformatics 17(9), 763–774 (2000)
Article Google Scholar
Ben-Dor, A., Friedman, N., Yakhini, Z.: Class discovery in gene expression data. In: Proceedings of the fifth Annual International Conference on Computational Biology, pp. 31–38 (2001)
Google Scholar
Xing, E.P., Karp, R.M.: Cliff: Clustering of high-dimensional microarray data via iterative feature filtering using normalized cuts. Oxford Bioinformatics 17(1), 306–315 (2001)
Article Google Scholar
von Heydebreck, A., Huber, W., Poustka, A., Vingron, M.: Identifying splits with clear separation: a new class discovery method for gene expression data. Oxford Bioinformatics 17, 107–114 (2001)
Article Google Scholar
Tang, C., Zhang, A., Ramanathan, M.: ESPD: a pattern detection model underlying gene expression profiles. Oxford Bioinformatics 20(6), 829–838 (2004)
Article Google Scholar
Varma, S., Simon, R.: Iterative class discovery and feature selection using Minimal Spanning Trees. BMC Bioinformatics 5, 126 (2004)
Article Google Scholar
Glez-Peña, D., Álvarez, R., Díaz, F., Fdez-Riverola, F.: DFP: A Bioconductor package for fuzzy profile identification and gene reduction of microarray data. BMC Bioinformatics 10, 37 (2009)
Article Google Scholar
Armstrong, S.A., Stauton, J.E., Silverman, L.B., Pieters, R., den Boer, M.L., Minden, M.D., Sallan, S.E., Lander, E.S., Golub, T.R., Korsmeyer, S.J.: MLL translocations specify a distinct gene expression profile that distinguishes a unique leukemia. Nature Genetics 20, 41–47 (2002)
Article Google Scholar

Download references

Author information

Authors and Affiliations

ESEI: Escuela Superior de Ingeniería Informática, University of Vigo, Edificio Politécnico, Campus Universitario As Lagoas s/n, 32004, Ourense, Spain
Daniel Glez-Peña, José R. Méndez & Florentino Fdez-Riverola
Dept. Informática, University of Valladolid, Escuela Universitaria de Informática, Plaza Santa Eulalia, 9-11, 40005, Segovia, Spain
Fernando Díaz
Dept. Informática y Automática, University of Salamanca, Plaza de la Merced s/n, 37008, Salamanca, Spain
Juan M. Corchado

Authors

Daniel Glez-Peña
View author publications
You can also search for this author in PubMed Google Scholar
Fernando Díaz
View author publications
You can also search for this author in PubMed Google Scholar
José R. Méndez
View author publications
You can also search for this author in PubMed Google Scholar
Juan M. Corchado
View author publications
You can also search for this author in PubMed Google Scholar
Florentino Fdez-Riverola
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Graduate School of Engineering, Osaka Prefecture University, Osaka, Japan
Sigeru Omatu
Department of Informatics / CCTC, University of Minho, Braga, Portugal
Miguel P. Rocha
MAmI Research Lab, University of Castilla-La Mancha,, Ciudad Real, Spain
José Bravo
Department of Informatics, University of Vigo, Ourense, Spain
Florentino Fernández
Grupo de Investigación GICAP, Área de Lenguajes Higher Polytechnic School, Universidad de Burgos, Burgos, Spain
Emilio Corchado
Higher Polytechnic School, University of Burgos, Burgos, Spain
Andrés Bustillo
Department of Informatics, University of Salamanca, Salamanca, Spain
Juan M. Corchado

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Glez-Peña, D., Díaz, F., Méndez, J.R., Corchado, J.M., Fdez-Riverola, F. (2009). An Evolutionary Approach for Sample-Based Clustering on Microarray Data. In: Omatu, S., et al. Distributed Computing, Artificial Intelligence, Bioinformatics, Soft Computing, and Ambient Assisted Living. IWANN 2009. Lecture Notes in Computer Science, vol 5518. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-02481-8_148

Download citation

DOI: https://doi.org/10.1007/978-3-642-02481-8_148
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-02480-1
Online ISBN: 978-3-642-02481-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics