Abstract
Clustering analysis of time series data from DNA microarray hybridization studies is essential for identifying biological relevant groups of genes. Microarrrays provide large datasets that are currently primarily analyzed using crisp clustering techniques. Crisp clustering methods such as K-means or self organizing maps assign each gene to one cluster, thus omitting information concerning the multiple roles of genes. One of the major advantages of fuzzy clustering is that genes can belong to more than one group, revealing this way more profound information concerning the function and regulation of each gene. Additionally, recent studies have proven that integrating a small amount of information in purely unsupervised algorithms leads to much better performance. In this paper we propose a new semi-supervised fuzzy clustering algorithm which we apply in time series gene expression data. The clustering that was performed on simulated as well as experimental microarray data proved that the proposed method outperformed other clustering techniques.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Eisen, M., Spellman, P., Brown, P., Botstein, D.: Cluster analysis and display of genome-wide expression patterns. Proc. Nat. Acad. Sci. 95, 14863–14868 (1998)
Liew, A.W.-C., Yan, H., Yang, M.: Pattern recognition techniques for the emerging field of bioinformatics. Pattern Recognition 38, 2055–2073 (2005)
Guthke, R., Moller, U., Hoffmann, M., et al.: Dynamic network reconstruction from gene expression data applied to immune response during bacteria infection. Bioinformatics 21, 1626–1634 (2005)
Tsao, E., Bezdek, J., Pal, N.: Fuzzy Kohonen clustering networks. Pattern Recognition 27, 757–764 (1994)
Asyali, M.H., Alci, M.: Reliability analysis of microarray data using fuzzy c-means and normal mixture modeling based classification methods. Bioinformatics 21, 644–649 (2005)
Belacel, N., et al.: Fuzzy J-Means and VNS methods for clustering genes from microarray data. Bioinformatics 20, 1690–1701 (2004)
Dembele, D., Kastner, P.: Fuzzy C-means method for clustering microarray data. Bioinformatics 19, 973–980 (2003)
Bezdek, J.C.: Pattern Recognition with Fuzzy Objective Function Algorithms. Plenum Press, New York (1981)
Kohonen, T.: Self-Organized Maps, 2nd edn. Springer, Heidelberg (1997)
Pal, N.R., Bezdek, J.C.: On Cluster Validity for the Fuzzy C-means model. IEEE Transactions on Fuzzy Systems 3, 370–379 (1995)
Möller-Levet, C.S., Klawonn, F., Cho, K.-H., Yin, H., Wolkenhauer, O.: Clustering of unevenly sampled Gene Expression Time-Series Data. Fuzzy Sets and Systems 152, 49–66 (2005)
Schliep, A., Costa, I.G., Schonhuth, A.: Analyzing Gene Expression Time- Courses. IEEE/ACM Transactions on Computational Biology and Bioinformatics 2, 179–193 (2005)
Cho, R.J., Campbell, M.J., Winzeler, E.A., Steinmetz, L., Conway, A., Wodicka, L., Wolfsberg, T.G., Gabrielian, A. E., Landsman, D., Lockhart, D. J., Davis, R. W.: A genome-wide transcriptional analysis of the mitotic cell cycle. Molecular Cell 2, 65–73 (1998)
Filkov, V., Skiena, S., Zhi, I.: Analysis Techniques for Microarray Time-Series Data. Journal of Computational Biology 9, 317–330 (2002)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Maraziotis, I.A., Dragomir, A., Bezerianos, A. (2006). Semi Supervised Fuzzy Clustering Networks for Constrained Analysis of Time-Series Gene Expression Data. In: Kollias, S., Stafylopatis, A., Duch, W., Oja, E. (eds) Artificial Neural Networks – ICANN 2006. ICANN 2006. Lecture Notes in Computer Science, vol 4132. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11840930_85
Download citation
DOI: https://doi.org/10.1007/11840930_85
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-38871-5
Online ISBN: 978-3-540-38873-9
eBook Packages: Computer ScienceComputer Science (R0)