Abstract
Clustering technique is one of the useful tools to elucidate similar patterns across large number of transcripts and to identify likely co-regulated genes. It attempts to partition the genes into groups exhibiting similar patterns of variation in expression level. An application of rough-fuzzy c-means (RFCM) algorithm is presented in this paper to discover co-expressed gene clusters. Selection of initial prototypes of different clusters is one of the major issues of the RFCM based microarray data clustering. The pearson correlation based initialization method is used to address this limitation. It enables the RFCM algorithm to discover co-expressed gene clusters. The effectiveness of the RFCM algorithm and the initialization method, along with a comparison with other related methods, is demonstrated on five yeast gene expression data sets using standard cluster validity indices and gene ontology based analysis.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Boyle, E.I., Weng, S., Gollub, J., Jin, H., Botstein, D., Cherry, J.M., Sherlock, G.: GO: Term Finder Open Source Software for Accessing Gene Ontology Information and Finding Significantly Enriched Gene Ontology Terms Associated with a List of Genes. Bioinformatics 20(18), 3710–3715 (2004)
Dembele, D., Kastner, P.: Fuzzy C-Means Method for Clustering Microarray Data. Bioinformatics 19(8), 973–980 (2003)
Domany, E.: Cluster Analysis of Gene Expression Data. Journal of Statistical Physics 110, 1117–1139 (2003)
Maji, P., Pal, S.K.: Rough Set Based Generalized Fuzzy C-Means Algorithm and Quantitative Indices. IEEE Transactions on System, Man, and Cybernetics, Part B: Cybernetics 37(6), 1529–1540 (2007)
Maji, P., Paul, S.: Microarray Time-Series Data Clustering Using Rough-Fuzzy C-Means Algorithm. In: Proceedings of 5th IEEE International Conference on Bioinformatics and Biomedicine, pp. 1–4 (2011)
Shamir, R., Sharan, R.: CLICK: A Clustering Algorithm for Gene Expression Analysis. In: Proceedings of the 8th International Conference on Intelligent Systems for Molecular Biology (2000)
Theodoridis, S., Koutroumbas, K.: Pattern Recognition, 4th edn. Elsevier (2009)
Wang, J., Delabie, J., Aasheim, H.C., Smeland, E., Myklebost, O.: Clustering of the SOM Easily Reveals Distinct Gene Expression Patterns: Results of a Reanalysis of Lymphoma Study. BMC Bioinformatics 3(36), 1–9 (2002)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Maji, P., Paul, S. (2012). Rough-Fuzzy C-Means for Clustering Microarray Gene Expression Data. In: Kundu, M.K., Mitra, S., Mazumdar, D., Pal, S.K. (eds) Perception and Machine Intelligence. PerMIn 2012. Lecture Notes in Computer Science, vol 7143. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-27387-2_26
Download citation
DOI: https://doi.org/10.1007/978-3-642-27387-2_26
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-27386-5
Online ISBN: 978-3-642-27387-2
eBook Packages: Computer ScienceComputer Science (R0)