A semi-supervised hierarchical approach: two-dimensional clustering of microarray gene expression data

Priscilla, R.; Swamynathan, S.

doi:10.1007/s11704-013-1076-z

A semi-supervised hierarchical approach: two-dimensional clustering of microarray gene expression data

Research Article
Published: 13 March 2013

Volume 7, pages 204–213, (2013)
Cite this article

Frontiers of Computer Science Aims and scope Submit manuscript

R. Priscilla¹ &
S. Swamynathan¹

266 Accesses
6 Citations
Explore all metrics

Abstract

Micro array technologies have become a widespread research technique for biomedical researchers to assess tens of thousands of gene expression values simultaneously in a single experiment. Micro array data analysis for biological discovery requires computational tools. In this research a novel two-dimensional hierarchical clustering is presented. From the review, it is evident that the previous research works have used clustering which have been applied in gene expression data to create only one cluster for a gene that leads to biological complexity. This is mainly because of the nature of proteins and their interactions. Since proteins normally interact with different groups of proteins in order to serve different biological roles, the genes that produce these proteins are therefore expected to co express with more than one group of genes. This constructs that in micro array gene expression data, a gene may makes its presence in more than one cluster. In this research, multi-level micro array clustering, performed in two dimensions by the proposed two-dimensional hierarchical clustering technique can be used to represent the existence of genes in one or more clusters consistent with the nature of the gene and its attributes and prevent biological complexities.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Clustering Techniques from Significance Analysis of Microarrays

Clustering-Based Techniques for Big Data Analysis of Gene Expression

A Comparative Analysis of Clustering and Biclustering Algorithms in Gene Analysis

References

Liang J, Kachalo S. Computational analysis of microarray gene expression profiles: clustering, classification, and beyond. Chemometrics and Intelligent Laboratory Systems, 2002, 62(2): 199–216
Article Google Scholar
Khlopova N S, Glazko V I, Glazko T T. Differentiation of gene expression profiles data for liver and kidney of pigs. World Academy of Science, Engineering and Technology, 2009, 31: 263–266
Google Scholar
Sarmah C, Samarasinghe S, Kulasiri D, Catchpoole D. A simple affymetrix ratio-transformation method yields comparable expression level quantifications with cdna data. International Journal of Biological and Life Sciences, 2012, 8(3): 157–162
Google Scholar
Gruzdz A, Ihnatowicz A, Siddiqi J, Akhgar B. Mining genes relations in microarray data combined with ontology in colon cancer automated diagnosis system. World Academy of Science, Engineering and Technology, 2008(16): 920–928
Google Scholar
Cvek U, Trutschl M, Randolph Stone I, Syed Z, Clifford J, Sabichi A. Multidimensional visualization tools for analysis of expression data. World Academy of Science, Engineering and Technology, 2009(30): 281–289
Google Scholar
Kim S, Choi T, Bae J. Fuzzy types clustering for microarray data. International Journal of Computational Intelligence, 2006, 2(1): 12–15
Google Scholar
Wu X, Chen Y, Brooks B, Su Y. The local maximum clustering method and its application in microarray gene expression data analysis. EURASIP Journal on Advances in Signal Processing, 1900, 2004(1): 53–63
Article Google Scholar
Kim S, Lee J, Bae J. Iterative clustering algorithm for analyzing temporal patterns of gene expression. World Academy of Science, Engineering and Technology, 2007(4): 502–505
Google Scholar
Chen G, Jaradat S, Banerjee N, Tanaka T, Ko M, Zhang M. Evaluation and comparison of clustering algorithms in analyzing ES cell gene expression data. Statistica Sinica, 2002, 12(1): 241–262
MathSciNet MATH Google Scholar
Qin Z. Clustering microarray gene expression data using weighted chinese restaurant process. Bioinformatics, 2006, 22(16): 1988–1997
Article Google Scholar
Lee M, Kim Y, Kim Y, Lee Y, Yoon H. An ant-based clustering system for knowledge discovery in DNA chip analysis data. In: Proceedings of World Academy of Science, Engineering and Technology. 2007, (5): 261–266
Google Scholar
Wang R, Scharenbroich L, Hart C, Wold B, Mjolsness E. Clustering analysis of microarray gene expression data by splitting algorithm. Journal of Parallel and Distributed Computing, 2003, 63(7): 692–706
Article Google Scholar
Kalocsai P, Shams S. Visualization and analysis of gene expression data. Journal of the Association for Laboratory Automation, 1999, 4(5): 58–61
Article Google Scholar
Van Der Laan M, Pollard K. A new algorithm for hybrid clustering of gene expression data with visualization and the bootstrap. Journal of Statistical Planning and Inference, 2003, 117: 275–303
Article MathSciNet MATH Google Scholar
Do J, Choi D, others. Clustering approaches to identifying gene expression patterns from DNA microarray data. Molecules and Cells, 2008, 25(2): 279
Google Scholar
Trepalin S, Yarkov A. Hierarchical clustering of large databases and classification of antibiotics at high noise levels. Algorithms, 2008, 1(2): 183–200
Article Google Scholar
Tuncbag N, Haliloglu T, Keskin O. Correspondence between function and interaction in protein interaction network of saccaromyces cerevisiae. International Journal of Biological and Medical Sciences, 2006, 1(3): 167–174
Google Scholar
Kim S, Hamasaki T. Evaluation of clustering based on preprocessing in gene expression data. International Journal of Biological, Biomedical and Medical Sciences, 2008, 3(1): 48–53
Google Scholar
Valarmathie P, Srinath M, Ravichandran T, Dinakaran K. Hybrid fuzzy C-means clustering technique for gene expression data. International Journal of Research and Reviews in Applied Sciences, 2009, 1(1): 33–37
Google Scholar
Dey L, Mukhopadhyay A. Microarray gene expression data clustering using PSO based K-means algorithm. UACEE International Journal of Computer Science and its Applications, 2009, 1(1): 232–236
Google Scholar
Mar J, Wells C, Quackenbush J. Defining an informativeness metric for clustering gene expression data. Bioinformatics, 2011, 27(8): 1094–1100
Article Google Scholar
Jing L, Ng M, Zeng T. Novel hybrid method for gene selection and cancer prediction. World Academy of Science, Engineering and Technology, 2010 (38): 482–489
Google Scholar
ALL/AML datasets. http://www.broadinstitute.org/cancer/software/genepattern/datasets/
Larsen B, Aone C. Fast and effective text mining using linear-time document clustering. In: Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining. 1999, 16–22
Google Scholar
Steinbach M, Karypis G, Kumar V, others. A comparison of document clustering techniques. In: KDD Workshop on Text Mining. 2000, 525–526
Google Scholar
Yin X, Chen S, Hu E, Zhang D. Semi-supervised clustering with metric learning: an adaptive kernel method. Pattern Recognition, 2010, 43(4): 1320–1333
Article MATH Google Scholar
Alfred R. Summarizing relational data using semi-supervised genetic algorithm-based clustering techniques. Journal of Computer Science, 2010, 6(7): 775–784
Article Google Scholar

Download references

Author information

Authors and Affiliations

Department of Information and Science and Technology, The College of Engineering, Guindy Campus, Anna University, Chennai, 600025, India
R. Priscilla & S. Swamynathan

Authors

R. Priscilla
View author publications
You can also search for this author inPubMed Google Scholar
S. Swamynathan
View author publications
You can also search for this author inPubMed Google Scholar

Corresponding author

Correspondence to R. Priscilla.

Additional information

R Priscilla is a PhD research scholar in the Department of Information Science and Technology, Anna University, India. She is currently an associate professor at the Department of Information Technology, St. Josephs Institute of Technology, Chennai with the teaching experience of 10 years. Her major research interests include advanced database technology, Web services, data mining, and bioinformatics.

S Swamynathan received his PhD in Computer Science and Engineering in Anna University, India. He is currently an associate professor at the Department of Information Science and Technology, Anna University, Chennai with the teaching experience of 15 years. His current research interests include advanced database technology, distributed systems, artificial intelligence, XML and Web services, semantic Web services, and bioinformatics.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Priscilla, R., Swamynathan, S. A semi-supervised hierarchical approach: two-dimensional clustering of microarray gene expression data. Front. Comput. Sci. 7, 204–213 (2013). https://doi.org/10.1007/s11704-013-1076-z

Download citation

Received: 20 June 2011
Accepted: 16 August 2012
Published: 13 March 2013
Issue Date: April 2013
DOI: https://doi.org/10.1007/s11704-013-1076-z

Keywords

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A semi-supervised hierarchical approach: two-dimensional clustering of microarray gene expression data

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Clustering Techniques from Significance Analysis of Microarrays

Clustering-Based Techniques for Big Data Analysis of Gene Expression

A Comparative Analysis of Clustering and Biclustering Algorithms in Gene Analysis

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now